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RECOMBINANT VECTOR SYSTEM FOR USING SAME AND 
RECOMBINANT-DNA METHOD FOR THE MANUFACTURE OF SAME 



BACKGROUND OF THE INVENTION 
ij This is a continuation -in-part application of U.S. Patent 

! j 

^Application Serial No. 699,181, filed February 5, 1985. 
\ Endogenous proteolytic enzymes serve to degrade invading organ- 
• isms, antigen-antibody complexes and certain tissue proteins 
: which are no longer necessary or useful to the organism. In a 
normally functioning organism, proteolytic enzymes are produced 
in a limited quantity and are regulated in part through specific 
inhibitors. 

Metalloproteinases are enzymes present in the body which are 
often involved in the degradation of connective tissue. While 
'gj some connective tissue degradation is necessary for normal func- 

jjjjj tioning of an organism, an excess of connective tissue degrada- 

tion occurs in several disease states and is believed to be at- 
tributable, at least in part, to excess metalloproteinase. It is 
believed that metalloproteinases are at least implicated in peri- 
odontal disease, corneal and skin ulcers, rheumatoid arthritis 
and the spread of cancerous solid tumors. 

These diseases generally occur in areas of the body which 
contain a high proportion of collagen, a particular form of con- 
nective tissue. An examination of patients with these diseases 
of connective tissue has revealed an excessive breakdown of the 
various components of connective tissues, including collagen 
proteoglycans and elastin. Therefore, it has been deduced that 
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an excessive concentration of a particular metalloproteinase , fc 
example" collagenase, proteoglyconuse, gelatinase, and certain 
elastases, - may cause or exacerbate the connective tissue destruc- 
tion associated with the aforementioned diseases. 
»; In the normal state, the body possesses metalloproteinase 

inhibitors which bind to metalloproteinases to effectively pre- 

i 

vent these enzymes from acting on their connective tissue sub- 
: j strates. Specifically, ina healthy organism, metalloproteinase 
.; inhibitors are present in concentrations sufficient to interact 
;with metalloproteinases to an -extent which allows sufficient 

quantities of metalloproteinase to remain active while binding 

0: the excess metalloproteinase so that the connective tissue damage 

MS 

seen in the various diseases does not occur. 

£ j 

%4 It is postulated that one immediate cause of the connective 

m 

tissue destruction present in the foregoing disease states is an 
imbalance in the relative metalloproteinase/metalloproteinase in- 
hibitor concentrations. In these situations, either due to an 
excessive amount of active metalloproteinase or a deficiency in 



the amount of active metalloproteinase inhibitor, the excess met- 
alloproteinase is believed to cause the connective tissue degra- 
dation responsible for causing or exacerbating the disease. It 
is postulated that, by treating persons with connective tissue 
diseases with metalloproteinase inhibitors, the degradative 
action of the excess metalloproteinase may be curtailed or 
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! ! halted. Therefore, particular metalloproteinase inhibitors of 
'specific interest to the present inventors are collagenase inhib- 
, itors because it is believed that these inhibitors would be phar- 
maceutical^ useful in the treatment or prevention of connective 
tissue diseases. 

The existence of metalloproteinase and metalloproteinase in- 
hibitors has been discussed in the scientific literature. For 
example, Sellers et al. , Biochemical And Biophysical Research 
Communications 87:581-587 (1979) , discusses isolation of rabbit 
bone collagenase inhibitor. Collagenase inhibitor isolated from 
human skin fibroblasts is discussed in Stricklin and Welgus, 
J. B.C. 258:12252-12258 (1983) and Welgus and Stricklin, J. B.C. 
258:12259-12264 <1983). The presence of collagenase inhibitors 
in naturally-occurring body fluids is further discussed in Murphy 
et al. , Biochem. J. 195:167-170 (1981) and Cawston et al. , 
Arthritis and Rheumatism, 27:285 (1984). In addition, metallo- 
M. proteinase inhibitors are discussed by Reynolds et al. in 

Cellular Interactions , Dingle and Gordon, eds., (1981). Although 
these articles characterize particular, isolated metallopro-; 
teinase inhibitors and discuss, to some extent, the role or 
potential role of metalloprote inases in connective tissue disease 
treatment and speculate on the ability of metalloproteinase 
inhibitors to counteract this destruction, none of these re- 
searchers had previously been able to isolate a portable DNA 
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sequence capable of directing intracellular production of metal- 
loproteinase inhibitors or to create a recombinant-DNA method fc 
j J the production of these inhibitors. 

jj Surprisingly, the present inventors/ have discovered a porta^ 

ble DNA sequence capable of direct ingythe recombinant-DNA synthe- 
jjsis of metalloproteinase inhibitors/ These metalloproteinase in- 
hibitors are biologically equivalent to those isolated from humar 
; skin fibroblast cultures. The rptetalloproteinase inhibitors of 
,the present invention, prepared by the recombinant-DNA methods 
: set forth herein, will enab/e increased research into prevention 
and treatment of metalloproteinase- induced connective tissue dis- 
eases. In addition, the metalloproteinase inhibitors of the 
present invention are/ useful in neutralizing metalloproleinases , 
including the excels metalloproteinase associated with disease 
states. Therefor^, it is believed that a cure for these diseases 
will be develop/d which will embody, as an active ingredient, the 
metalloproteiirfase inhibitors of the present invention. Further- 
more, the mexalloproteinase inhibitors of the present invention 
are capabl/e of interacting with their metalloproteinase targets 
in a manner which allows the development of diagnostic tests for 
degrad^tive connective tissue diseases using the newly discovered 
inhibitors . 

The recombinant metalloproteinase inhibitors discussed here- 
in interact stoichiometr ically (i.e., in a 1:1 ratio) with their 
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metalloproteinase targets. In addition, these metalloprote inase 
i inhibitors are heat resistant, acid stable, glycosylated, and ex- 
jhibit a high isoelectric point. 



^ sequences. 



SUMMARY OF THE INVENTION 

i 

.; The present invention relates to metalloproteinase inhib- 

; ;itors and a recombinant-DNA method of producing the same and to 
portable DNA sequences capable of directing intracellular produc- 
tion of the metalloproteinase inhibitors. Particularly, the 
present invention relates to a collagenase inhibitor, a recombi- 
nant-DNA method for producing the same and to portable DNA se- 
quences for use in the recombinant method. The present invention 
also relates to a series of vectors containing these portable DNA 



One object of the present invention is to provide a metallo- 
proteinase inhibitor, which can be produced in sufficient quan- 
tities and purities to provide economical pharmaceutical composi- 



l»* tions which possess metalloproteinase inhibitor activity. 



M An additional object of the present invention is to provide 

a recombinant-DNA method for the production of these metallopro- 
teinase inhibitors. The recombinant metalloproteinase inhibitors 
produced by this method are biologically equivalent to the metal- 
loproteinase inhibitor isolable from human skin fibroblast cul- 
tures . 
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To facilitate the recombinant-DNA synthesis of these metal- 
loproteinas inhibitors, it is a further object of the present 
invention to provide portable DNA sequences capable of directing 
intracellular production of metalloproteinase inhibitors. It is 
also an object of the present invention to provide cloning 
vectors containing these portable sequences. These vectors are 
capable of being used in recombinant systems to produce pharma- 

liceutically useful quantities of metalloproteinase inhibitors. 

; i 

Additional objects and advantages of the invention will be 
set forth in part in the description which follows, and in rart 
will be obvious from the description or may be learned from prac 

i 

!tice of the invention. The objects and advantages may be real- 

! ized and attained by means of tne instrumentalities and combina- 
tions particularly pointed out in the appended claims. 

To achieve the objects and in accordance with the purposes 
of the present invention, metalloproteinase inhibitors are set 
forth, which are capable of stoichiometric reaction with metallo 
proteinases. These metalloproteinase inhibitors are remarkably 
heat resistant, acid stable, glycosylated, and exhibit a high 
isoelectric point. Furthermore, these metalloproteinase inhib- 

j itors are biologically equivalent to those inhibitors isolated 

; f rom human skin fibroblast cultures. 

To further achieve the objects and in accordance with the 
purposes of the present invention, as embodied and broadly 



law ornccs 

Finnecan. Henderson 
Farabcw Garrett 
d Dunner 

1773 H STftCCT. N . W. 
WASHINGTON . O.C. IOOO« 



-6- 



w 

in 



desc^^ed herein, portable DNA sequ^Rs coding for metallopro- 
teinas* inhibitors are provided. These sequences comprise nucle- 
otid sequences capable of directing intracellular production of 
metalloproteinase inhibitors. The portable sequences may be ei- 
|! ther synthetic sequences or restriction fragments ("natural" DNA 
! sequences). In a preferred embodiment, a portable DNA sequence 
:is isolated from a human fibroblast cDNA library and is capable 
ii «f directing intracellular production of a collagenase inhibitor 
; j which is biologically equivalent to that inhibitor which is 
:! isolable from a human skin fibroblast culture. 

The coding st/and of a first preferred DNA sequence which 
^has been discove/ed has the following nucleotide sequence: 

i 

i 10 20 30 40 50 60 

; GTTGTTGCTG TGGCTGATAG CCCCAGCAGG GCCTGCACCT GTGTCCCACC CCACCCACAG 

80 90 100 HO 120 

GCAATTCCGA CCTCGTCATC AGGGCCAAGT TCGTGGGGAC ACCAGAAGTC 

140 150 160 170 180 

CCTTATACCA GCGTTATGAG ATCAAGATGA CCAAGATGTA TAAAGGGTTC 

200 210 220 230 240 

GGGATGCCGC TGACATCCGG TTCGTCTACA CCCCCGCCAT GGAGAGTGTC 



il^^e 



70 

ACGGCCTTCT 
130 

AACCAGACCA 
190 

CAAGCCTTAG 
250 

TGCGGATACT 

; 310 

; CAGGATGGAC 
370 

; TTAGCTCAGC 
430 

• TTTCCCTGTT 



290 



300 



260 270 280 

TCCACAGGTC CCACAACCGC AGCGAGGAGT TTCTCATTGC TGGAAAACTG 

320 330 340 350 360 

TCTTGCACAT CACTACCTGC AGTTTCGTGG CTCCCTGGAA CAGCCTGAGC 

380 390 400 410 420 

GCCGGGGCTT CACCAAGACC TACACTGTTG GCTGTGAGGA ATGCACAGTG 

440 450 460 470 480 

TATCCATCCC CTGCAAACTG CAGAGTGGCA CTCATTGCTT GTGGACGGAC 
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a an 500 ?ln W 520 530 

CAGCTCCTCC AAGGCTCTGA AAAGGGCTTC CAGTCCCGTC ACCTTGCCTG CCTGCCTCu* 

ccn 560 570 580 590 6C 

GAGCCAGGGG TGTGCACCTG GCAGTCCCTG CGGTCCCAGA TAGCCTGAAT CCTGCCCGG 

cm 620 630 640 650 66 

GTGGAAGCTG AAGCCTGCAC AGTGTCCACC CTGTTCCCAC TCCCATCTTT CTTCCGGAC 

670 680 690 700 

ATGAAATAAA GAGTTACCAC CCAGCAAAAA AAAAAAGGAA TTC 

The nucleotides represented by the foregoing abbreviations are 
set forth in the Detailed Description of the Preferred Embodi- 
ments. 

A second preferrred Dn/ sequence has been discovered which 
has an additional nucleoside sequence 5' to the initiator se- 
quence. This sequenc/ which contains as the eighty-second 
through four-hundr^-thirty-second nucleotides nucleotoides 1 
through 351 of J^e first preferred sequence set forth above, has 
the followino/nucleotide sequence: 

in 20 30 40 50 i 

GGCCATCGCC GCAGATCCAG CGCCCAGAGA GACACCAGAG AACCCACCAT GGCCCCCT 

on 1 90 100 HO 1 

GACCCCTGGC TTCTGCATCC TGTTGTTGCT GTGGCTGATA GCCCCAGCAG GGCCTGCA, 

,,- , 40 150 ' 160 170 1' 

TGTGTCCCAC CCCACCCACA GACGGCCTTC TGCAATTCCG ACCTCGTCAT CAGGGCCA. 

;ttcgtgggga caccagaag? caaccagacc accttatacc agcgtta?ga gatcaaga 

ACCAAGATG? ATAAAGGg" CCAAGCCTTA GGGGATGCM CTGACATCCG GTTCGTCT 
ACCCCCGCCA TGGAGAGTGT CTGCGGATAC TTCCACAGG? CCCACAACCG CAGCGAGG 



utwr orrtccs 

Finnecan. Henderson _ g _ 

Farabow. Garrett 

8 DUSTMEFl 

|7T» N STMCCT.N. W. 
w*ftnit40TOM. O. C.IOOO* 



U1 




370 380 390 400 410 42 

TTTCTCATTG CTGGAAAACT GCAGGATGGA CTCTTGCACA TCACTACCTG CAGTTTCGT 

430 

GCTCCCTGGA AC 

A third preferred^ DNA sequence which incorporates the 5' re 
tlgion of the second .preferred sequence and the 3* sequence of the 
ij first preferred sequence, has the following nucleotide sequence: 

i • 

10 20 30 40 50 6 

GGCCATCGCC GCAGATCCAG CGCCCAGAGA GACACCAGAG AACCCACCAT GGCCCCCTT 



i 

' 70 80 90 100 110 12 

GACCCCTGGC TTCTGCATCC TGTTGTTGCT GTGGCTGATA GCCCCAGCAG GGCCTGCAC 

i 130 140 150 160 170 18 

TGTGTCCCAC CCCACCCACA GACGGCCTTC TGCAATTCCG ACCTCGTCAT CAGGGCCAA 

190 200 210 220 230 24 

TTCGTGGGGA CACCAGAAGT CAACCAGACC ACCTTATACC AGCGTTATGA GATCAAGAT 

! 250 260 270 280 290 30 

ACCAAGATGT ATAAAGGGTT CCAAGCCTTA GGGGATGCCG CTGACATCCG GTTCGTCTA 

310 320 330 340 350 36 

ACCCCCGCCA TGGAGAGTGT CTGCGGATAC TTCCACAGGT CCCACAACCG CAGCGAGGA 

370 380 390 400 410 42 

TTTCTCATTG CTGGAAAACT GCAGGATGGA CTCTTGCACA TCACTACCTG CAGTTTCGT 

430 440 450 460 470 48 

GCTCCCTGGA ACAGCCTGAG CTTAGCTCAG CGCCGGGGCT TCACCAAGAC CTACACTGT 

490 500 510 520 530 54 

GGCTGTGAGG AATGCACAGT GTTTCCCTGT TTATCCATCC CCTGCAAACT GCAGAGTGG 

,| 550 560 570 580 590 6C 

;!actcattgct TGTGGACGGA CCAGCTCCTC caaggctctg AAAAGGGCTT CCAGTCCCC 

! 610 620 630 640 650 66 

'cACCTTGCCT GCCTGCCTCG GGAGCCAGGG CTGTGCACCT GGCAGTCCCT GCGGTCCCJ 

670 680 690 700 . 710 7: 

ATAGCCTGAA TCCTGCCCGG AGTGGAAGCT GAAGCCTGCA CAGTGTCCAC CCTGTTCCC 
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730 740 750 760 770 780 

ZTCCCATCTT TCTTCCGGAC AATGAAATAA AGAGTTACCA CCCAGCAAAA AAAAAAAGGA 
To facilitate identification and isolation of natural DNA 

sequences for use in the present invention, the inventors have 
developed a human skin fibroblast cDNA library. This library 
contains the genetic information capable of directing a cell to 
synthesize the metalloproteinase inhibitors of the present inven- 
tion. Other natural DNA sequences which may be used in the 
i 

irecombinant DNA methods set forth herein may be isolated from 

I 

human genomic libraries. 

J Additionally, portable DNA sequences useful in the processes 

of the present invention may be synthetically created. These 

j - 

synthetic DNA sequences may be prepared by polynucleotide synthe- 

sis and sequencing techniques known to those of ordinary skill in 

i 

the art. 

Additionally, to achieve the objects and in accordance with 
jthe purposes of the present invention, a recombinant-DNA method 
is disclosed which results in microbial manufacture of the in- 
stant metalloproteinase inhibitors using the portable DNA se- 
quences referred to above. This recombinant DNA method com- 
prises : 

(a) preparation of a portable DNA sequence capable of 

i 

i directing a host microorganism to produce a protein 

having metalloproteinase inhibitor activity, preferably 
collagenase inhibitor activity; 
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(b) cloning the portable DNA sequence into a vector capable 
of being transferred into and replicating in a host mi- 
croorganism, such vector containing operational ele- 
ments for the portable DNA sequence; 

(c) transferring the vector containing the portable DNA se- 
quence and operational elements into a host microorga- 
nism capable of expressing the metalloproteinase inhib- 
itor protein; 

(d) culturing the host microorganism under conditions 
appropriate for amplification of the vector and expres- 
sion of the inhibitor; and 

(e) in either order: 

(i) harvesting the inhibitor; and 

(ii) causing the inhibitor to assume an active, terti- 
ary structure whereby it possesses metallopro- 
teinase inhibitor activity. 

To further accomplish the objects and in further accord with 
the purposes of the present invention, a series of cloning vec- 
tors are provided comprising at least one of the portable DNA se- 
quences discussed above. In particular, plasmid pUC9-F5/237P10 
is disclosed. 

It is understood that both the foregoing general description 
and the following detailed description are exemplary and explana- 
tory only and are not restrictive of the invention, as claimed. 
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The accompanying drawing, which is incorporated in and con- 
.'stitutes a part of this specification, illustrates one embodimen 

. i 

;!of th invention and, together with the description, serves to 
explain the principles of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a partial restriction map of the plasmid 
pUC9-F5/237P10. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Reference will now be made in detail to the presently pre- 
ferred embodiments of the invention, which, together with the 
drawing and the following examples, serve to explain the princi- 
ples of the invention. 



^ • ... 

£0 As noted above, the present invention relates in part to 



portable DNA sequences capable of directing intracellular produc- 
tion of metalloproteinase inhibitors in a variety of host micro- 
organisms. "Portable DNA sequence" in this context is intended 
^ to refer either to a synthetically-produced nucleotide sequence 

4* or to a restriction fragment of a naturally occuring DNA se- 

M« quence. For purposes of this specification, "metalloproteinase 

inhibitor" is intended to mean the primary structure of the pro- 
tein as defined by the codons present in the deoxyribonucleic 
acid sequence which directs intracellular production of the amine 
acid sequence, and which may or may not include post- trans la- 
tional modifications. It is contemplated that such 
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post-translat ional modifications include, for example, 
glycosylat ion. It is further intended that the term "metallopro 
teinase inhibitor" refer to either the form of the protein as 
would be excreted from a microorganism or the methionyl-metallo- 
:| proteinase inhibitor as- it may be present in microorganisms from 

ij which it was not excreted. 

■1 

;j In a preferred embodiment, the portable DNA sequences are 

:| capable of directing intracellular production of collagenase in- 

■ I 

ihibitors. In a particularly preferred embodiment, the portable 
; DNA sequences are capable of directing intracellular production 
i of a collagenase inhibitor biologically equivalent to that previ 
ously isolated from human skin fibroblast cultures. By "biologi 
cally equivalent", as used herein in the specification and 
claims, it is meant that an inhibitor, produced using a portable 
DNA sequence of the present invention, is capable of preventing 
collagenase- induced tissue damage of the same type, but not nec- 
essarily to the same degree, as a native human collagenase inhib 
itor, specifically that native human collagenase inhibitor 
isolable from human skin fibroblast cell cultures. 

A first preferred /portable DNA sequence of the present 
invention has a nucleotide sequence as follows: 
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20 


30 


40 


50 


60 


GTTGTTGCTG 


TGGCTGATAG 


CCCCAGCAGG 


GCCTGCACCT 


GTGTCCCACC 


CCACCCACAG 


70 


80 


A A 

90 


i a a 

100 


110 


120 


ACGGCCTTCT 


GCAATTCCGA 


CCTCGTCATC 


AGGGCCAAGT 


TCGTGGGGAC 


ACCAGAAGTC 


130 


140 


150 


160 


170 


180 


AACCAGACCA 


CCTTATACCA 


GCGTTATGAG 


ATCAAGATGA 


CCAAGATGTA 


TAAAGGGTTC 


190 


200 


210 


220 


230 


AAA 

240 


CAAGCCTTAG 


GGGATGCCGC 


TGACATCCGG 


TTCGTCTACA 


CCCCCGCCAT 


GGAGAGTGTC 


250 


260 


270 


280 


290 


"I A 

300 


TGCGGATACT 


TCCACAGGTC 


CCACAACCGC 


AGCGAGGAGT 


TTCTCATTGC 


TGGAAAACTG 


310 


320 


330 


340 


350 


360 


CAGGATGGAC 


TCTTGCACAT 


CACTACCTGC 


AGTTTCGTGG 


CTCCCTGGAA 


CAGCCTGAGC 


370 


380 


390 


400 


A 1 A 

410 


J A A 

4 20 


TTAGCTCAGC 


GCCGGGGCTT 


CACCAAGACC 


TACACTGTTG 


GCTGTGAGGA 


ATGCACAGTG 


430 


440 


450 


M r A 

460 


470 


A Q n 

480 


TTTCCCTGTT 

111 Www 1 W * * 


TATCCATCCC 


CTGCAAACTG 


CAGAGTGGCA 


CTCATTGCTT 


GTGGACGGAC 


490 


500 


510 


520 


e ^ a 

, 530 


C A A 
54 U 


CAGCTCCTCC 

wAww 1 WW 1 WW 


AAGGCTCTGA 


AAAGGGCTTC 


CAGTCCCGTC 


ACCTTGCCTG 


CCTGCCTCGG 


550 


560 


570 


580 


A A 

590 


600 


GAGCCAGGGC 


TGTGCACCTG 


GCAGTCCCTG 


CGGTCCCAGA 


TAGCCTGAAT 


CCTGCCCGGA 


610 


620 


630 


C A A 

640 


C CO 


oou 


GTGGAAGCTG 


AAGCCTGCAC 


AGTGTCCACC 


L, 1 vj I 1 LLLAL 


1 www/\ Hill 




670 


680 


690 


700 






ATGAAATAAA 


GAGTTACCAC 


CCAGCAAAAA 


AAAAAAGGAA 


TTC 





wherein the following nucleotides are represented by the abbrevi- 
ations indicated below. 
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'gleotides 
Deoxyadenylic acid 
Deoxyguanylic acid 
Deoxycytidylic acid 
^hymidylic acid 1 
'^Tsecond preferred/portable DMA sequence of the present 



eviat ion 
A 
G 

C 



Invention has the following nucleotide sequence: 




GGCCATCGCC GCAGATCCAG CGCCCAGAGA GACACCAgIS AACCCACCA? GGCCCCCTT? 
GAC'CCCTGGt TfCTGCATCC TGTTGTTGCT GTGGCTGAT'A GCCCCAGCAG GGCCTGCACC 
1 TGTGTCfcCA'C CCCACCCACA GACGGCCTTC TGCAATTCCG ACCTCGTCA? CAGGGCCAAG 
TTCGTGGGGA CACCAGAAG^ CAACCAGACC ACCTTATACC AGCGTTA^GA GATCAAGAtS 
' ACCAAGATdT ATAAAGGGTT CCAAGCCTTA GGGGATGCCG CTGACATCCG GTTCGTCTAC 
ACCCCCGCCA TGGAGAGTGT CTGCGGATAC TTCCACAGGT CCCACAACCG CAGCGAGGAG 
TTTCTCATTG CTGGAAAAcS GCAGGATG^A CTCTTGcEJ TCACTACC^ CAGTTTcSS 
430 

GCTCCCTGGA AC 

in this second preferred sequence, an open reading frame exists 
from nucleotides 1 throu/h 432. The first methionine of this 
reading frame is encod/ by nucleotides by 49 through 5! and i. 
the site of translat/n initiation. It should be noted that the 
amino acid sequence/ prescribed nucleotides 49 through 114 is not 



FlNNECAN. HENDEWON 

FaRabow. Garrett 

& DUNNER. 

»TT» » STUCCT.M.W. 

wAS*mCTO*.oc»ooo« 



-15- 




CO 



Si 



14: 



founofcn the mature meta^oproteina It is believed that this 
sequence is the leade^/peptide of the human protein. 

A thi^a" preferred portable DNA sequence has the mucleotide 

sequence: 

in 20 30 40 . 50 60 

GGCCATCGCC GCAGATCCAG CGCCCAGAGA GACACCAGAG AACCCACCAT GGCCCCCTTT 

_ n SQ 9 o 100 HO I 20 

GACCCCTGGC TTCTGCATCC TGTTGTTGCT GTGGCTGATA GCCCCAGCAG GGCCTGCACC 

nn 140 150 160 170 180 

TGTGTCCCAC CCCACCCACA GACGGCCTTC TGCAATTCCG ACCTCGTCAT CAGGGCCAAG 

ion 200 210 220 230 240 

TTCGTGGGGA CACCAGAAGT CAACCAGACC ACCTTATACC AGCGTTATGA GATCAAGATG 

«n 260 270 280 290 300 

ACCAAGATGT ATAAAGGGtS CCAAGCCTTA GGGGATGCCG CTGACATCCG GTTCGTCTAC 

ACCCCCGCCA TGGAGAGTGT CTGCGGATAC TTCCACAGC? CCCACAACCG CAGCGAGGAG 
' , fln ion 400 410 420 

TTTCTCATTG CTGGAAAAC? GCAGGATGGA CTCTTGCACA TCACTACCTG CAGTTTCGTG 

GCTCCCTGGA ACAGCCTGAG CTTAGCTCAG CGCCGGGg" TCACCAAGAc' CTACACTG?T 

GGCTGTGAGG AATGCACAGT GTTTCCcIg? TTATCCATO CCTGCAAACT GCAGAGTGGC 

actcattgI? tgtggacIga ccagctcItc caaggctctg aaaaggg"? ccagtcc"? 
caccttgcc? gcctgcc"g ggagccag^ ctgtgcac" ggcagtccc? gcggtcc"g 
atagcctIIa tcctgcccgg agtggaag?? gaagcctka cagtgtccac cctgttcccS 
ctcccatct? tcttccggac aatgaaa^aa agagttacca cccagcaIIa aaaaaaagg, 
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•i This third sequence contains the S^fontranslated r gion of t.ne 
: second preferred sequence and the 3' region of the first pre- 
;| ferred sequence. It is envisioned that this third preferred se- 
quence is capable of directing intracellular production of a met- 
alloproteinase analogous to a mature human collagenase inhibitor 

; i 

, in a microbial or mammalian expression system. 

It must be borne in mind in the practice of the present 
.; invention that the alteration of some amino acids in a protein 
.'sequence may not affect the fundamental properties of the pro- 
tein. Therefore, it is also contemplated that other portable DN> 
sequences, both those capable of directing intracellular produc- 
tion of identical amino acid sequences and those capable of 
directing intracellular production of analogous amino acid se- 
quences which also possess metalloproteinase inhibitor activity, 
are included within the ambit of the present invention. 

It is contemplated that some of these analogous amino acid 
sequences will be substantially homologous to native human 
metalloproteinase inhibitors while other amino acid sequences, 
capable of functioning as metalloproteinase inhibitors, will not 
exhibit substantial homology to native inhibitors. By "substan- 
tial homology", as used herein, is meant a degree of homology to 
a native metalloproteinase inhibitor in excess of 50%, pref erabl 
in excess of 60%, preferably in excess of 80%. The percentage 
homology as discussed herein is calculated as the percentage of 
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j amino acid rescues found in the smaller of the two sequences 
; that align vitn identical amino acid residues in the sequence 
|| being compared when four gaps in a length of 100 amino acids may 
'be introduced to assist in that alignment as set forth by 
; Dayhoff, M.O. in Atlas of Protein Sequence and Structure Vol. 5, 
!p. 124 (1972), National Biochemical Research Foundation, 
Washington, D.C. , specifically incorporated herein by reference. 
:| As noted above, the portable DNA sequences of the present 

i i 

■invention may be synthetically created. It is believed that the 
■means for synthetic creation of these polynucleotide sequences 
are generally known to one of ordinary skill in the art, particu- 
0 larly in light of the teachings contained herein. As an example 

f$ of the current state of the art relating to polynucleotide syn- 

iijl thesis, one is directed to Matteucci, M.D. and Caruthers, M.H., 

d in J. Am. Chem. Soc. 103 : 3185 (1981) and Beaucage, S.L. and 

Caruthers, M.H. in Tetrahedron Lett. 22: 1859 (1981), specifical- 
ly ly incorporated herein by reference. 

^; Additionally, the portable DNA sequence may be a fragment of 

^ a natural sequence, i.e., a fragment of a polynucleotide which 

occurred in nature and which has been isolated and purified for 
the first time by the present inventors. In one embodiment, the 
portable DNA sequence is a restriction fragment isolated from a 
cDNA library. In this preferred embodiment, the cDNA library is 
created from human skin fibroblasts. 
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In an alternative embodiment, the portable DNA sequence is. 
isolated from a human yenomic library. An example of such a li- 
brary useful in this embodiment is set forth in Lawn et al . Cell 
15 ; 1157-1174 (1978), specifically incorporated herein by refer 
ij ence. 

|| As also noted above, the present invention relates to a se- 

ll 

i! ries of vectors, each containing at least one of the portable DN. 
! | sequences described herein. It is contemplated that additional 
copies of the portable DNA sequence may be included in a single 
•vector to increase a host microorganism's ability to produce 
large quantities of the desired metalloproteinase inhibitor. 

In addition, the cloning vectors* within the scope of the 
present invention may contain supplemental nucleotide sequences 
preceding or subsequent to the portable DNA sequence. These sup- 
plemental sequences are those that will not interfere with tran^ 
scription of the portable DNA sequence and will, in some in- 
M 8 stances as set forth more fully hereinbelow, enhance 

*S transcription, translation, or the ability of the primary amino 

& 

u acid structure of the resultant metalloproteinase inhibitor to 

assume an active, tertiary form. 

A preferred vector of the present invention is set forth in 
Figure 1. This vector, pUC9-F5/237P10 , contains the preferred 
nucleotide sequence set forth above. Vector pUC9-F5/237P10 is - 
present in the C600/pUC9-F5/237P10 cells on deposit in the 
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: American Type Culture Collection in Rockville, Maryland under 

, I 

; Accession No, 53003. 

1 1 

;! A preferred nucleotide sequence encoding the metallopro- 

ii 

teinase inhibitor is identified in Figure 1 as region A. Plasmid 
pUC9-F5/237PlO also contains supplemental nucleotide sequences 
preceding and subsequent to the preferred portable DNA sequence 
in region A. These supplemental sequences are identified as re- 
gions B and C f respectively. 

In alternate preferred embodiments, either one or both of 
the preceding or subsequent supplemental sequences may be removed 
from the vector of Fig. 1 by treatment of the vector with re- 
ft strict ion endonucleases appropriate for removal of the supplemen- 



tal sequences. The supplemental sequence subsequent to the por- 
table DNA sequence, identified in Fig. 1 as region C, may be 
removed by treatment of the vector with a suitable restriction 
f. endonuclease, preferably Hq i AI followed by reconstruction of the 

M j 3' end of region A using synthetic oligonucleotides and ligation 



J! of the vector with T-4 DNA ligase. Deletion of the supplemental 



U sequence preceding the portable DNA sequence, identified as re- 

gion B in Fig. 1, would be specifically accomplished by the meth- 
od set forth in Example 2. 

In preferred embodiments, cloning vectors containing and ca- 
pable of expressing the portable DNA sequence of the present 
invention contain various operational elements. These 
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|| "ope^lional elements," as discuss erein, include at least one 
promoter, at least one Shine-Dalgarno sequence, at least one 
terminator codon. Preferably, these "operational elements" also 
! include at least one operator, at least one leader sequence, and 
! for proteins to be exported from intracellular space, at least 

one regulator and any other DNA sequences necessary or preferred 
' for appropriate transcription and subsequent translation of the 
I vector DNA. 

Additional embodiments of the present invention are envi- 
sioned as employing other known or currently undiscovered vectors 
which would contain one or more of the portable DNA sequences de- 
scribed herein. In particular, it is preferred that these 
§ vectors have some or all of the following characteristics: (1) 

| possess a minimal number of host-organism sequences; (2) be sta- 

ble in the desired host; (3) be capable of being present in a 
| high copy number in the desired host; (4) possess a regulatable 

promoter; (5) have at least one DNA sequence coding for a se- 
lectable trait present on a portion of the plasmid separate from 
that where the portable DNA sequence will be inserted; and (6) b. 
integrated into the vector. 

The following, noninclus ive , list of cloning vectors is be- 
lieved to set forth vectors which can easily be altered to meet 
the above-criteria and are therefore preferred for use in the 
present invention. Such alterations are easily performed by 
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those of ordinary skill in th art in light of the available lit- 
erature and the teachings herein. 



TABLE I 



HOST 



E. coli 



BACILLUS 

B. subtil is 

B. amyloliouef aciens 

B. stearothermophilus 



PSEUDOMONAS 
P. aeruginosa 
P. putida 



CLOSTRIDIUM 
C. perfrinqens 



SACCHAROMYCES 
S. cerevisiae 



Vectors 

pUC8 

pUC9 

pBR322 

pGW7 

placid 

pDP8 

pUBHO 

pSAOSOl 

pSA2100 

pBD6 

pBD8 

PT127 

RSF1010 
Rmsl49 
pKT209 
RK2 

pSa727 

pJU12 

pJU7 

pJUlO 

pJU16 

pJU13 

YEp24 

YIp5 

YRpl7 



Comments 

Many selectable replicons 
have been characterized. 
Maniatis, T. et al. (1982), 
Molecular Cloning: A 
Laboratory Manual . Cold 
Spring Harbor Laboratory. 

Genetics and Biotechnology 
of Bacilli , Ganesan and 

Academic 



Hoch, eds., 
Press. 



1984, 



Some vectors useful in 
broad host range of gram- 
negative bacteria including 
Xanthomonas and Agrobacter i\. 



Shuttle plasmids for E. 
coli and C. perfrinqens 
construction ref. Squires, 
C. et al. (1984) Journal 
Bacteriol. 159 ;465-471. 
Botstein and Davis in 
Molecular Biology of the 
Yeast Saccharomyces , 
Strathern, Jones, and 
Broach, eds., 1982, Cold 
Spring Harbor Laboratory. 



It is to be understood that additional cloning vectors may now 
exist or will be discovered which have the above- ident i f ied prop- 
erties and are therefore suitable for use in the present 
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invention. These vectors are also contemplated as being -iLhin 
the scop of the disclosed series of cloning vectors into which 
the portable DNA sequences may be introduced, along with any nec- 
essary operational elements, and which altered vector is then in- 
cluded within the scope of the present invention and would be ca- 
pable of being used in the recombinant-DNA method set forth more 
fully below. 

In addition to the above list, an E. coli vector system, as 
; set forth in Example 2, is preferred in one embodiment as a 
cloning vector. Moreover, several vector plasmids which autono- 
mously replicate in a broad range of Gram Negative bacteria are 
preferred for use as cloning vehicles in hosts of the genera 
5 Pseudomonas . These are described by Tait, R.C. , Close, T.J., 

Lundquist, R.C. , Hagiya, M., Rodriguez, R.L., and Kado, C.I- in 
Biotechnology, May, 1983, pp. 269-275; Panopoulos, N.J. in 
Genetic Engineering in the Plant Sciences , Praeger Publishers, 
New York, New York, pp. 163-185, (1981); and Sakaguchi, K. in 
M Current Topic in Microbiology and Immunology 96:31-45, (1982), 

P each of which is specifically incorporated herein by reference. 

One particularly preferred construction employs the plasmid 
RSF1010 and derivatives thereof as described by Bagdasarian, M., 
Bagdasarian, M.M. , Coleman, S., and Timmis, K.N. in Plasmids of 
Medical Environmental and Commercial Importance , Timmis, K.N. and 
Puhler, A. eds., Elsevier/North Holland Biomedical Press, (1979), 
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!i sp cif ically incorporated herein by reference. The advantages of 
RSFlOrO are that it is relatively small, high copy number plasmid 
which is readily transformed into and stably maintained in both 
E. coli and Pseudomonas species. In this system, it is preferred 
! to use the Tac expression system as described for Escherichia. 
; since it appears that the E. coli trp promoter is readily recog- 
nized by Pseudomonas RNA polymerase as set forth by Sakaguchi, K. 
; in Current Topics in Microbiology and Immunology 96:31-45 (1982) 
and Gray, G.L., McKeown, K.A. , Jones, A.J.S., Seeburg, P.H., and 
Heyneker, H.L. in Biotechnology Feb. 1984, pp. 161-165, both of 
which are specifically incorporated herein by reference. Tran- 
scriptional activity may be further maximized by requiring the 
exchange of the promoter with, e.g., an E. coli on P. aeruginosa 
trp promoter. 

In a preferred embodiment, P. aeruginosa is transformed with 
vectors directing the synthesis of the metalloproteinase inhib- 
itor as either an intracellular product or as a product coupled 
j to leader sequences that will effect its processing and export 

from the cell. In this embodiment, these leader sequences are 
preferably selected from the group consisting of beta-lactamase , 
OmpA protein, the naturally occurring human signal peptide, and 
that of carboxyeptidase G2 from Pseudomonas . Translation may be 
coupled to translation initiation for any of the E. coli proteins 
as described in Example 2, as well as to initiation sites for any 
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j: of the highly expressed proteins of the host to cause in- 

i i 

' tracellular expression of the metalloproteinase inhibitor. 

In those cases where restriction minus strains of a host 
.1 Pseudomonas species are not available, transformation efficiency 
: with plasmid constructs isolated from E. coli are poor. There- 
]\ fore, passage of the Pseudomonas cloning vector through an r- m+ 
.: strain of another species prior to transformation of the desired 
:!host, as set forth in Bagdasarian, M. , et al., Plasmids of 
Medical, Environmental and Commercial Importance , pp. 411-422, 

1 i 
I 

;; Timnus . and Puhler eds., Elsevier/North Holland Biomedical Press 

!(1979), specifically incorporated herein by reference, is de- 

0 sired. 

Furthermore, a preferred expression system in hosts of the 

Jff genera Baci llus involves using plasmid pUBHO as the cloning ve- 

Jj| hide. As in other host vector systems, it is possible in 

N Baci llus to express the metalloproteinase inhibitors of the pres- 

et 

M ent invention as either an intracellular or a secreted protein. 

Ms 

U The present embodiments include both systems. Shuttle vectors 

ij that replicate in both Bac i llus and E. coli are available for 

M? 

constructing and testing various genes as described by Dubnau, 
D., Gryczan, T., Contente, S., and Shivakumar, A.G. in Genetic 
Engineering , Vol. 2, Setlow and Hollander eds., Plenum Press, New 
York, New York, pp. 115-131, (1980), specifically incorporated 
herein by reference. For the expression and secretion of 
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metalloproteinase inhibitors from B. subtil is , the signal se- 
|j quence* of alpha-amylase is preferably coupled to the coding re- 
ii gion for the metalloproteinase inhibitor. For synthesis of 
! intracellular metalloproteinase inhibitor, the portable DNA se- 
quence will be translat ionally coupled to the ribosome binding 
site of the alpha-amylase leader sequence. 

Transcription of either of these constructs is preferably 
; directed by the alpha-amylase promoter or a derivative thereof. 
This derivative contains the RNA polymerase recognition sequence 
of the native alpha-amylase promoter but incorporates the lac op- 
erator region as well. Similar hybrid promoters constructed from 
the penicillinase gene promoter and the lac operator have been 
shown to function in Bacillus hosts in a regulatable fashion as 
set forth by Yansura, D.G. and Henner in Genetics and Biotechnol- 
ogy of Bac i 11 i , Ganesan, A.T. and Hoch, J. A., eds., Academic 
Press, pp. 249-263, (1984), specifically incorporated by refer- 
ence. The lad gene of lacl^ would also be included to effect 
regulat ion . 

One preferred construction for expression in Clostridium is 
in plasmid pJU12 described by Squires, C. H. et al in J. 
Bacteriol. 159:465-471 (1984), specifically incorporated herein 
by reference, transformed into C. perfrinqens by the method of 
Heefner, D. L. et al. as described in J. Bacteriol. 159:460-464 
(1984), specifically incorporated herein by reference. 
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Transcription is directed by the promoter of the tetracycline re- 
sistance gene. Translation is coupled to the Shine-Dalgarno se- 
|j qu nces of this same tet r gene in a manner strictly analogous to 
ij the procedures outlined above for vectors suitable for use in 
' other hosts. 

i: Maintenance of foreign DNA introduced into yeast can be ef- 

fected in several ways (Botstein, D., and Davis, R. W., in The 
;j Molecular Biology of the Yeast Saccharomvces . Cold Spring Harbor 

: l 

'Laboratory, Strathern r Jones and Broach r eds., pp. 607-636 

■ j 

(1982). One preferred expression system for use with host organ- 
isms of the genus Saccharomyces harbors the ant icollagenase gene 
on the 2 micron plasmid. The advantages of the 2 micron circle 
include relatively high copy number and stability when introduced 
into cir° strains. These vectors perferably incorporate the rep- 
lication origin and at least one antibiotic resistance marker 
from pBR322 to allow replication and selection in E. coli . In 



h ik addition, the plasmid will preferably have 2 micron sequences and 

the yeast LEU 2 gene to serve the same purposes in LEU2 mutants of 



y : yeast. 

The regulatable promoter from the yeast GAL1 gene will pref- 
erably be adapted to direct transcription of the portable DNA se- 
quence gene. Translation of the portable DNA sequence in yeast 
will be coupled to the leader sequence that directs the secretion 
of yeast alpha-factor. This will cause formation of a fusion 
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prwein which will be processed i^feast and result in secretion 
•i of a metalloproteinase inhibitor. Alternatively, a methionyl- 
metalloproteinase inhibitor will be translated for inclusion 

I i 
! i 

: within the cell. 

It is anticipated that translation of mRNA coding for the 
metalloproteinase inhibitor in yeast will be more efficient with 
the preferred codon usage of yeast than with the sequence presen 
in pUC8-Fic, as identified in Example 2, which has been tailored 
to the prokaryotic bias. For this reason, the portion of the 5' 
end of the portable DNA sequence beginning, at the Tthllll site i 
preferably resynthesized. The new sequence favors the codons 
most frequently used in yeast. This new sequence preferably has 
the following nucleotide sequence: 



5' 



HgiA) 
GAT CCG TGC 



CT TQT GTT CCA CCA CAC 
GC ACG TGA ACA CAA GGT GGT GTG 



CCA CAA ACT GCT VrC TGT AAC TCT GAC . C 
GGT GTT TGA CGA AAG ACA TTG AGA CTG GA 3' 
As will be seen from an examination of the individual 
cloning vectors and systems contained on the above li-z and de- 
scription, various operational elements may be present in each c 
the preferred vectors of the present invention. It is contem- 
plated any additional operational elements which may be requirec 
may be added to these vectors using methods known to those of 
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ordinary skill in the art, particularly in light of the teachings 
herein*." 

In practice, it is possible to construct each of these 
vectors in a way that allows them to be easily isolated, assem- 
ijbled, and interchanged. This facilitates assembly of numerous 
!j functional genes from combinations of these elements and the 
!j coding region of the metalloproteinase inhibitor. Further, many 
•of these elements will be applicable in more than one host. 
'] At least one origin of replication recognized by the contem- 

plated host microorganism, along with at least one selectable 
■marker and at least one promoter sequence capable of initiating 
.transcription of the portable DNA sequence are contemplated as 
being included in these vectors. It is additionally contemplated 
that the vectors, in certain preferred embodiments, will contain 
III DNA sequences capable of functioning as regulators ("operators"), 

and other DNA sequences capable of coding for regulator proteins. 
In preferred vectors of this series, the vectors additionally 
contain ribosome binding sites, transcription terminators and 
leader sequences. 

These regulators, in one embodiment, will serve to prevent 
•expression of the portable DNA sequence in the presence of cer- 
tain environmental conditions and, in the presence of other envi- 
ronmental conditions, allow transcription and subsequent expres- 
sion of the protein coded for by the portable DNA sequence. In 
r 
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particular, it 4c preferred that regulatory segments be inserted 
j into the vector such that expression of the portable DNA sequence 
will not occur in the absence of, for example, isopro- 
pylthio-P* -d-galactoside. In this situation, the transformed mi- 
croorganisms containing the portable DNA may be grown to a de- 
sired density prior to initiation of the expression of the 
metalloproteinase inhibitor. In this embodiment, expression of 
the desired protease inhibitor is induced by addition of a sub- 
stance to the microbial environment capable of causing expression 
of the DNA sequence after the desired density has been achieved. 
Additionally, it is preferred that an appropriate secretory 
;3j leader sequence be present, either in the vector or at th 5' end 

of the portable DNA sequence, the leader sequence being in a 
position which allows the leader sequence to be immediately adja- 



til 

Ijlj cent to the initial portion of the nucleotide sequence capable of 



directing expression of the protease inhibitor without any inter- 
vening transcription or translation termination signals. The 
presence of the leader sequence is desired in part for one or 
more of the following reasons: 1) the presence of the leader se- 
quence may facilitate host processing of the initial product to 
the mature recombinant metalloproteinase inhibitor; 2) the pres- 
ence of the leader sequence may facilitate purification of the 
recombinant metalloproteinase inhibitors, through directing the 
metalloproteinase inhibitor out of the cell cytoplasm; 3) the 
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presence of the leader seauence may affect the ability of the 
recombinant metalloproteinase inhibitor to fold to its active 
structure through directing the metalloproteinase inhibitor out 
of the cell cytoplasm. 

In particular, the leader sequence may direct cleavage of 
the initial translation product by a leader peptidase to remove 
the leader sequence and leave a polypeptide with the amino acid 
sequence which has the potential of metalloproteinase inhibitory 
activity. In some species of host microorganisms, the presence 
of the appropriate leader sequence will allow transport of the 
completed protein into the periplasmic space, as in the case of 
E. coli . In the case of certain yeasts and strains of Bacillus 
and Pseudomonas , the appropriate leader sequence will allow 
transport of the protein through the cell membrane and into the 
extracellular medium. In this situation, the protein may be 
purified from extracellular protein. 

Thirdly, in the case of some of the metalloproteinase inhib- 
itors prepared by the present invention, the presence of the 
leader sequence may be necessary to locate the completed protein 
in an environment where it may fold to assume its active struc- 
ture, which structure possesses the appropriate metalloproteinase 
activity. 

Additional operational elements include, but are not limited 
to, ribosome-binding sites and other DNA sequences necessary for 



law orriccs 

ftnnecan. henderson 
Farabow. Garrett 

& DUNNER 
i»ts * stucct.h.w. 
wA»MtNQTo*«. o. c.aoooe 

I202i29J-e«ftO 



-31- 



microbial expression of foreign p^teins. The operational ele- 

lj 

jments as discussed herein can be routinely selected by those of 

ji 

^ordinary skill in the art in light of prior literature and the 
i teachings contained herein. General examples of these operation- 
al elements are set forth in B. Levin, Genes , Wiley & Sons, New 
:York (1983), which is specifically incorporated herein by refer- 
ence. Various examples of suitable operational elements may be 
| found on the vectors discussed above and may be elucidated 

. i 

"through review of the publications discussing the basic charac- 
teristics of the aforementioned vectors. 

In one preferred embodiment of the present invention, an 
additional DNA sequence is located immediately preceding the por- 
table DNA sequence which codes for the metalloproteinase inhib- 
itor. The additional DNA sequence is capable of functioning as a 
translat ional coupler, i.e., it is a DNA sequence that encodes an 
RNA which serves to position ribosomes immediately adjacent to 
the ribosome binding site of the metalloproteinase inhibitor RNA 
with which it is contiguous. 

Upon synthesis and/or isolation of all necessary and desired 
component parts of the above-discussed cloning vectors, the 
vectors are assembled by methods generally known to those of or- 
dinary skill in the art. Assembly of such vectors is believed to 
be within the duties and tasks performed by those with ordinary 
skill in the art and, as such, is capable of being performed 
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without undue experimentation. For examp)*- similar DNA se- 
quences have been ligated into appropriate cloning vectors, as 
set forth in Schoner et al . , Proceedings of th National Academy 
of Sciences U.S.A. , 81:5403-5407 (1984), which is specifically 
j incorporated herein by reference. 

f j In construction of the cloning vectors of the present inven- 

tion, it should additionally be noted that multiple copies of the 
^portable DNA sequence and its attendant operational elements may 

li 

:be inserted into each vector. In such an embodiment, the host 

j! 

^organism would produce greater amounts per vector of the desired 
,: metalloproteinase inhibitor. The number of multiple copies of 
^ the DNA sequence which may be inserted into the vector is limited 

only by the ability of the resultant vector, due to its size, to 
be transferred into and replicated and transcribed in an appro- 
priate host microorganism. 
4 Additionally, it is preferred that the cloning vector con- 

tain a selectable marker, such as a drug resistance marker or 
other marker which causes expression of a selectable trait by the 



host microorganism. In a particularly preferred embodiment of 

& ...... 

the present invention, the gene for ampicillin resistance is in- 
deluded in vector pUC9-F5/237P10 . 

J Such a drug resistance or other selectable marker is intend- 

ed in part to facilitate in the selection of t ransf ormants . 
Additionally, the presence of such a selectable marker on the 
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cloning vector may be of use in keeping contaminating microorga- 

i I 

j nisms -from multiplying in the culture medium. In chis embodi- 



ijment, such a pure culture of the transformed host microorganisms 

ji 

;| would be obtained by culturing the microorganisms under condi- 
tions which require the induced phenotype for survival, 
i; It is noted that, in preferred embodiment, it is also desir- 

i able to reconstruct the 3' end of the coding region to allow 

! 

;i assembly with 3 f non-translated sequences. Included among these 

'I 

; non-translated sequences are those which stabilize the mRNA or 
enhance its transcription and those that provide strong tran- 
scriptional termination signals which may stabilize the vector as 
they are identified by Gentz, R. , Langner, A., Chang, A.C.Y., 

.Cohen, S.H., and Bujard, H. in Proc. Natl, Acad. Sci. USA 
78:4936-4940 (1981), specifically incorporated herein by refer- 
ence. 

This invention also relates to a recombinant-DNA method for 
the production of metallproteinase inhibitors. Generally, this 
method includes: 

(a) preparation of a portable DNA sequence capable of 
directing a host microorganism to produce a protein 
having metalloproteinase inhibitor activity; 

(b) cloning the portable DNA sequence into a vector capable 
of being transferred into and replicating in a host mi- 
croorganism, such vector containing operational 
elements for the portable DNA sequence; 
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(c) transferring the vector containing the portable Dii A se- 
quence and operational elements into a host microorga- 
nism capable of expressing the metalloproteinase inhib- 
itor protein; 

(d) culturing the host microorganism under conditions 
appropriate for amplification of the vector and expres- 
sion of the inhibitor; and 

: j (e) in either order: 

; (i) harvesting the inhibitor; and 

\\ (ii) causing the inhibitor to assume an active, teiti- 

'! ary structure whereby it possesses metallopro- 

j 

teinase inhibitor activity. 
» yTiv'N In this method, the portable DNA sequences are those syn- 
v thetic or naturally-occurring polynucleotides described above. 

In a preferred embodiment of the present method, the portable DWA 
sequence has the nu/leotide sequence as follows: 

10 20 30 40 50 60 

GTTGTTGCTG TGGCTGATAG CCCCAGCAGG GCCTGCACCT GTGTCCCACC CCACCCACAG 

70 80 90 100 110 120 

ACGGCCTTCT GCAATTCCGA CCTCGTCATC AGGGCCAAGT TCGTGGGGAC ACCAGAAGTC 

130 140 150 160 170 180 

AACCAGACCA CCTTATACCA GCGTTATGAG ATCAAGATGA CCAAGATGTA TAAAGGGTTC 

190 200 210 220 230 240 

CAAGCCTTAG GGGATGCCGC TGACATCCGG TTCGTCTACA CCCCCGCCAT GGAGAGTGTC 

250 260 270 280 290 300 

TGCGGATACT TCCACAGGTC CCACAACCGC AGCGAGGAGT TTCTCATTGC TGGAAAACTG 
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~ 310 320 330 ~ 340 350 360 

' CAGGATGGAC TCTTGCACAT CACTACCTGC AGTTTCGTGG CTCCCTGGAA CAGCCTGAGC 

. 370 380 390 400 410 420 

TTAGCTCAGC GCCGGGGCTT CACCAAGACC TACACTGTTG GCTGTGAGGA ATGCACAGTG 

430 440 450 460 470 480 

! TTTCCCTGTT TATCCATCCC CTGCAAACTG CAGAGTGGCA CTCATTGCTT GTGGACGGAC 

;i 490 500 510 520 530 540 

! CAGCTCCTCC AAGGCTCTGA AAAGGGCTTC CAGTCCCGTC ACCTTGCCTG CCTGCCTCGG 

550 560 570 580 590 600 

GAGCCAGGGC TGTGCACCTG GCAGTCCCTG CGGTCCCAGA TAGCCTGAAT CCTGCCCGGA 

1 610 620 630 640 650 660 

1 GTGGAAGCTG AAGCCTGCAC AGTGTCCACC CTGTTCCCAC TCCCATCTTT CTTCCGGACA 

670 680 -690 700 

Jatgaaataaa GAGTTACCAC CCAGCAAAAA AAAAAAGGAA TTC 

The vectors contemplated as being useful in the present 

method are those described above. In a preferred embodiment, the 



VM cloning vector pUC9-F5/237P10 is used in the disclosed method. 



The vector thus obtained is then transferred into the appro- 
l!| priate host microorganism. It is believed that any microorganism 

having the ability to take up exogenous DNA and express those 
genes and attendant operational elements may be chosen. It is 
preferred that the host microorganism be an anaerobe, facultative 
anaerobe or aerobe. Particular hosts which may be preferable for 
use in this method include yeasts and bacteria. Specific yeasts 
include those of the genus Saccharomvces , and especially 
Saccharomvces cerevisiae . 

Specific bacteria include those of the genera Bacillus and 
Escherichia and Pseudomonas . Various other preferred hosts are 
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set forth in Table I, supra . In other, alternatively preferred 
embodim nts of the present invention, Bacillus subt ilis . 
jl Escherichia coli or Pseudomonas aeruginosa are selected as the 
| host microorganisms. 

j After a host organism has been chosen, the vector is trans- 

| ferred into the host organism using methods generally known by 
; those of ordinary skill in the art. Examples of such methods may 

i 

'be found in Advanced Bacterial Genetics by R. W. Davis et al . , 
'cold Spring Harbor Press, Cold Spring Harbor, New York, (1980), 
;which is specifically incorporated herein by reference. It is 
preferred, in one embodiment, that the transformation occur at 
pi low temperatures, as temperature regulation is contemplated as a 

means of regulating gene expression through the use of operation- 
al elements as set forth above. In another embodiment, if 
osmolar regulators have been inserted into the vector, regulation 
of the salt concentrations during the transformation would be re- 
M quired to insure appropriate control of the synthetic genes. 

U. If it is contemplated that the recombinant metalloprote inase 

p inhibitors will ultimately be expressed in yeast, it is preferred 

that the cloning vector first be transferred into Escherichia 
coli, where the vector would be allowed to replicate and from 
which the vector would be obtained and purified after amplifica- 
tion. The vector would then be transferred into the yeast for 
ultimate expression of the metalloproteinase inhibitor. 
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The host microorganisms are cultured under conditions appro- 
priat€"'for the expression of th metalloproteinase inhibitor. 
These conditions are generally specific for the host organism, 
:|and are readily determined by one of ordinary skill in the art, 
!in light of the published literature regarding the growth condi- 
tions for such organisms, for example Berqev's Manual of 

Determinative Bacteriology , 8th Ed., Williams & Wilkins Company, 
^Baltimore, Maryland, which is specifically incorporated herein by 

! ! 

reference. 

* i 

- j 

Any conditions necessary for the regulation of the expres- 
sion of the DNA sequence, dependent upon any operational elements 
inserted into or present in the vector, would be in effect at the 
transformation and culturing stages. In one embodiment, the 
cells are grown to a high density in the presence of appropriate 
regulatory conditions which inhibit the expression of the DNA se- 
quence. When optimal cell density is approached, the environ- 
mental conditions are altered to those appropriate for expression 
of the portable DNA sequence. It is thus contemplated that the 
production of the metalloproteinase inhibitor will occur in a 
time span subsequent to the 'growth of the host cells to near 
optimal density, and that the resultant metalloproteinase inhib- 
itor will be harvested at some time after the regulatory condi- 
tions necessary for its expression were induced. 
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In a preferred embodiment or^Re present invention, the 
recombinant metalloproteinase inhibitor is purified subsequent tc 
harvesting and prior to assumption of its active structure. This 
embodiment is preferred as th inventors believe that recovery ot 
i a high yield of re-folded protein is facilitated if the protein 
is first purified. However, in one preferred, alternate embodi- 
ment, the metalloproteinase inhibitor may be allowed re-fold to 
jl 

;! assume its active structure prior to purification. In yet 

h 

, ! another preferred, alternate embodiment, the metalloproteinase 

: i 

'} inhibitor is caused to assume its re-folded, active state upon 
; i recovery from the culturing medium. 

i In certain circumstances, the metalloproteinase inhibitor 

will assume its proper, active structure upon expression in the 
host microorganism and transport /oi the protein through the cell 
wall or membrane or into the pe^iplasmic space. This will gener 
ally occur if DNA coding f or yan appropriate leader sequence has 
been linked to the DNA coding for the recombinant protein. The 
preferred metalloprbtiena/e inhibitors of the present invention 
will assume their matur/, active form upon translocation out of 
the inner cell membranfe. The structures of numerous signal 
peptides have been published, for example by Marion E.E. Watson 
in Nuc. Acid Res. tlx 515-5164 , 1984, specifically incorporated 
Iherein by reference. It is intended that these leader sequences, 
together with portable DNA, will direct intracellular production 
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of^fcusion proteiny^Tch will be^fcnsported through the cell 
i! membrane and will/have the leader sequence portion cleaved upon 
release from tKe cell. 

1 

I In a preferred embodiment, the signal peptide of the E.coli 

'ompA protein is used as a leader sequence and is located in a 

r 

; position contiguous with the portable DNA sequence coding for the 
' metalloproteinese inhibitor structure. 

I Additionally preferred leader sequences include those of 

1 

' beta-lactamase, carboxypeptidase G2 and the human signal protein. 

i 

I These and other leader sequences are described. 

! If the metalloproteinase inhibitor does not assume its prop- 

er, active structure, any disulfide bonds which have formed 
and/or any noncovalent interactions which have occurred will 
;j first be disrupted by denaturing and reducing agents, for exam- 

jj pie, guanidinium chloride and j3-mercaptoethanol , before the 

J metalloproteinase inhibitor is allowed to assume its active 

structure following dilution and oxidation of these agents under 

M= controlled conditions. 

The transcription terminators contemplated herein serve to 
stabilize the vector. In particular, those sequences as de- 
scribed by Gentz et al. , in Proc. Natl. Acad. Sci. USA 28: 
4936-4940 (1981), specifically incorporated herein by reference, 
are contemplated for use in the present invention. 
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is to be understood that a(Jication of the teachings of 
the present invention to a specific problem or environment will 
be within the capabilities of one having ordinary skill in the 
art in light of the teachings contained herein. Examples of the 
products of the present invention and representative processes 

lifor their isolation and manufacture appear in the following exam- 

!j 

pies. 

:! EXAMPLES 

:i 

i EXAMPLE 1 

^ : j r L^ : ^^ o^PoWTA^ **S~*rom JffiFVSA^iWaWs^ 
o° '■ I HEF-3A cells were grown to near confluence in 75 cm 2 

^T-flasks. Cells were washed twice in Dulbecco's phosphat buff- 
ered saline solution and harvested by the addition of 2 ml of 10 
! mM Tris, pH 7.5 containing 1% v/v SDS (obtained from BDH chemi- 
cals, Ltd., Poole, England), 5 mM EDTA and 20 ug/ml protease K 
(obtained from Boehringer Mannheim Biochemicals , Indianapolis, 
Indiana). Each flask was subsequently washed with an additional 
M. milliliter of this same solution. 

The pooled aliquots from the cell harvest were made to 70 
ug/ml in protease K and incubated at 40'C for 45 minutes. The 
proteolyzed solution was brought to a NaCl concentration of 150 
mK by the addition of 5 M stock and subsequently extracted with 
ian equal volume of phenol : chloroform 1:1. The aqueous phase was 
reextracted with an equal volume of chloroform. Two volumes of 
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ethahol were added to the aqueous phase and incubated overnight 

i I 

j! at -20°C. The precipitated nucleic acids w r recovered by 

i i 

'I ■ - 

j centrif ugation at 17 , 500 xg for 10 minutes in a Beckman J2-21 
! ! centrifuge, Beckman Instruments, Palo Alto, California, and were 

|i redissolved in 25 ml of 0.1% w/v SDS. This solution was again 

'. i 

'extracted with an equal volume of chloroform. The aqueous phase 
was added to two volumes of cold ethanol and kept at -20 °C for 2 
hours. The precipitate was collected by centrif ugation at 10,000 
' xg for 15 minutes and redissolved in 10 ml of 1 mM Tris, 0.5 mM 
EDTA, 0.1% SDS, pH 7.5. RNA was precipitated from this solution 
by the addition of 10 ml of 4 M LiCl, 20 mM NaoAc, pH 5.0 and in- 
tubated at -20°C for 18 hours. The precipitate was again recov- 
ered by centrif ugation and. washed twice with 2 M LiCl before 



Vj redissolving in 1 mM Tris, 0.5 mM EDTA, 0.1% SDS, pH 7.5. This 



solution was stored at -70°C. 
^ Chromatography on Oliqo dT Cellulose 

hh Total cellular RNA prepared as above was ethanol precipi- 

tate 

tated and redissolved in 0.5 M NaCl. Five ml of RNA at 0.45 
mg/ml were applied to a 1 ml column of washed type VII oligo dT 
cellulose (obtained from PL Biochemicals , Milwaukee, Wisconsin). 
The column was then washed with 10 ml of 0.5 M NaCl and eluted 
with 2.0 ml of sterile H2O. The eluted poly(A*) fraction of RNA 
was ethanol precipitated and dissolved to give a 1 mg/ml solution 
in 1 mM Tris, 0.1 mM EDTA, pH 8.0. This was stored at -70°C. 
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cDNA Synthesis 

Paly(A + ) RNA was primed with oligo dT (obtained from PL 
I Biochemicals, Milwaukee, Wisconsin) to serve as a template for 

cDNA synthesis by AMV reverse transcriptase (obtained from Life 
^Sciences, Inc., St. Petersburg, Florida). Following the synthe- 
sis reaction, the RNA was hydrolyzed by the addition of 0.1 vol- 
ume of 3 N NaOH and incubated at 67°C for 10 minutes. The solu- 

iltion was then neutralized and the cDKA purified by gel filtration 

ji 

:| chromatography on biogel A 1.5 (obtained from BioRad La- 
It 

i:boratories, Richmond, California) in a 0.7x25 cm column in a 10 

i i 

iimM Tris, 5 mM EDTA, and 1% SDS, pH 7.5 solution. Fractions con- 
'taining cDNA were pooled and concentrated by ethanol precipita- 
tion. The cDNA was dG tailed and purified by gel filtration 
using the procedure set forth above. Second strand synthesis was 
primed with oligo dC and polymerized in an initial reaction with 
the large (Klenow) fragment of DNA polymerase (obtained from 
Boehringer Mannheim). Following second strand synthesis, E. coH 
DNA polymerase I (obtained from Boehringer Mannheim) was added 
and incubation continued to form blunt ends. The double stranded 
cDNA was again purified by chromatography. EcoRI restriction 
: sites within the cDNAs were modified by the action of EcoRI 
''methylase, obtained from New England Biolabs, Beverly, Mas- 
sachusetts. The cDNA was again purified and ligated to synthetic 
EcoRI linkers. Finally, the ends were then trimmed with the 
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endonuclease and the cDNA purified by gel filtration. This DNA 
was ligated into a unique Eco RI site in lambda gtlO DNA packaged 
in vitro and used to infect E. coli strain hf 1A according to the 
method set forth by Huynh, T.V., Young, R. A. , and Davis, R.W. , in 
ji DNA Cloning Techniques , A Practical Approach (ed. Glover, D.M.) 



(IRL Press Oxford), in press , specifically incorporated herein by 
reference. Approximately 25,000 recombinants were amplified in 
this manner. 
Screening 

ij Recombinant-phage-containing sequences of interest were se- 

| j 

ijlected by their preferential hybridization to synthetic oligo- 
nucleotides encoding portions of the primary structure of the de- 

:| 

■ ! sired metalloproteinase inhibitor, hereinafter referred to as 
"FIBAC. These portions of the protein sequence correspond in part 

i 

to those set forth in the published literature by Stricklin, G.P. 
and Welgus, H.G., J.. Biol. Chem. 258: 12252-12258 (1983), specif- 
ically incorporated herein by reference. Recombinant phage were 
used to infect E . coli strain hflA and plated at a density of 
approximately 2xl0 3 pfu/150 mm petri dish. Phage were blotted 
onto nitrocellulose f i Iters .( BA85 , Schleicher & Schuell Inc., 
ijKeene, New Hampshire), and DNA was denatured and fixed essential- 
ly as described by Benton and Davis in Science 196 : 180-182 (1979) 



Ispecif ically incorporated herein by reference, 
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^sing that procedure, the filA were treated sequentially 
for 10-15 minutes each in 0.5 M NaCl, then 1.0. M Tris, 1.5 M NaCl 
pH 8.0,' and finally submerged in 2x SSPE. (2x SSPS is 0.36 M 
NaCl, 20 mM NaH 2 P0 4 , 2 mM EDTA pH 7.4). Filters were blotted dry 
and baked 75°-80° for 3-4 hours. Duplicate filters were made of 
'each plate. Filters were prehybridized for 1-3 hours at 37" in 
isx SSPE containing O.lx SET, 0.15% NaPPi, and lx Denhardts solu- 
tions. Filters were then hybridized for 72 hours at 37° in this 
'same solution containing 5xl0 5 cpm/ml of 5' end-labeled 51-mer 
Oligonucleotide specific activity approximately 10 6 cpm/pmole). 
"Following hybridization, filters were washed six times in 5x SSPE 
containing O.lx SET and 0.05% sodium pyrophosphate at 37°, then 
three times in 2x SSPE at 21°. These were then blotted dry and 
autoradiographed on Kodak XAR-5 film at -70° with a Kodak 
lightening-plus intensifying screen. Signals clearly visible 
from duplicate filters were used to pick phage for plaque purifi- 
cation. Filter preparations and hybridization procedures for 
plaque purification steps were the same as above. The washing 
procedure was simplified to 6 changes of 2x SSPE at 37*. Six 
isolates purified by repetitive plating were then arranged on a 
single lawn of E . coli strain C600 for testing with subsequent 
probes . 

' Preferential hybridization of the 17-mer to each of the iso- 
lates (as opposed to control plaques) was observed under a 
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coition identical to that used ^plaque purif ica-m. Probe 
lc was used in a similar test, except that the SSPE concentration 
'during hybridization was reduced to 4x. Again, each of the iso- 
lates demonstrated stronger hybridization to the probe than did 

control plaques. 

'. Phaoe Purj fication an d cDNA Characterization 

Quantities of each of the six isolated phage were made by 
ithe plate stock technique and purified by serial CsCl block gra- 
dient centrifugation. DMA was extracted from these by dialysis 
against 50% formamide as described by Davis, R.W., Botstein, D., 
Ro th, J.R., in * M.nn.l for G enet ic Enginee ring- Advanced Bacte- 
rial Genetics , 1980, Cold Spring Harbor Laboratory, specifically 
incorporated herein by reference. DNA from each of the isolates 
was digested with EcoRI and the products were analyzed by agarose 
gel electrophoresis. The insert from one of the larger clones, 
lambda FIBAC 5, was found to lack internal sites for Sail, 
Hindi II, BamHI, and EcoRI. The cDNA insert was released from 
lambda FIBAC 5 DNA and the lambda arms digested by co-digesting 
with these four enzymes. The fragments were then ethanol- 
precipitated and ligated into the EcoRI site of plasmid P UC9 
without further purification. These plasmids were then used to 
transform E. coli strain JM83. Transf ormants were selected on 
ampicillin containing plates. Plasmids from several trans- 
formants were purified and characterized on the basis of the 
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EcoRI digestion products. One was selected which had an insert 
co-migrating with the insert from lambda FIBAC 5 on agarose gel 
electrophoresis. This plasmid has been named pUC9-F5/237PlO . 
Mapping and Subcloninq 

The insert in pUC9-F5/237P10 was mapped with respect to in- 
jjternal Pst I sites. Double digests with Eco RI and Pst demon- 
strated three internal PstI recognition sites. The entire insert 

: I 

Hand the component pieces were subcloned into M13 bacteriophage 

i i 

||mpl9 and mpl8, respectively. Sequencing of the pieces was per- 
; formed by the dideoxynucleot ide method described by Sanger et al . 
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'in Sanger, F., Nicklen, S., and Coulson, A.R., Proc. Natl. Acad. 
Sci. USA 74:5463-5467 (1977), specifically incorporated herein by 
reference. 

The sequence of the DNA insert from pUC9-F5/237P10 showed an 
open reading frame which encodes the primary structure of a ma- 
ture fibroblast collagenase inhibitor biologically equivalent to 
that isolable from human skin fibroblasts. The salient features 
of the sequence are: 

(1) The insert is flanked by Eco RI restriction 
sites and by G/C and A/T homopolymer ic tracts 
consistent with the cloning methodology; 

(2) The coding strand is presented in the 5 f to 
! 3* convention with poly C at the 5 1 end and 

poly A at the 3' end, again consistent with 
the techniques employed; 

(3) If the first G in the sequence GTTGTTG imme- 
diately adjacent to the 3' end of the poly C 
tract is considered as nucleotide 1, then an 
open reading frame is presented which encodes 
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the primary structure orthe matur human 
fibroblast collagenase inhibitor b ginning at 
nucleotide 34 and continuing through 
nucleotide 585; 

(4) Th termination codon TGA at nucleotides 586 
through 588 defines the carboxy terminus of 
the translation product which is the same as 
that of the mature protein; 

(5) Nucleotides 1 through 33 define an amino acid 
sequence which is not found in the primary 
structure of the processed protein, but which 
is probably a portion of a leader peptide 
characteristic of secreted proteins; 

(6) The three internal Pst I sites have as their 
first base nucleotides 298, 327, and 448; 

(7) There is a single recognition sequence for 
the restriction enzyme Tthllll beginning at 
nucleotide 78; and 

(8) There is a single recognition sequence for 

• the restriction endonuclease Ncol beginning 
at nucleotide 227. 

The sequence of nucleotides 1 through 703 and restriction site 
analysis are shown. 
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:! ACC 1 .(GTVWAC) 1 

j 

! ALU 1 (AGCT) 4 



I 



; AVA 1 (CQCGPG) 1 



• AVA 2 (GGRCC) 3 



BBV 1 (GCTGC) 1 
H BST Nl (CCRGG) 3 

w 

8 DDE 1 (CTNAG) 4 

M- 

fa?* 
M 

ECO Rl (GAATTC) 1 



FNU4H 1 (GCNGC) 2 



SITES FRAGMENTS FRAGMENTS ENDS 



214 


495 

"1 .7 J 




214 
4 x * 


709 




214 
t x ^ 


(30 2 ) 


1 

X 


£ ± *t 


358 

W <J 0 


J JO 


(50 5 ) 




7 5ft 

J 3 O 


363 


124 


(17.5) 


482 


606 


482 


X X j 


\ X o • o / 


363 

JO J 


Aft 7 


606 


103 


(14.5) 


606 


709 




c 


\ U . / / 


35ft 
J DO 


3 6 3 

J O J 


536 

J w 


5 36 

J JO 


(75 6 ) 


1 
X 


536 

3 J O 




i 'J 


( 7A A ) 


5 7 6 
J JO 


709 


257 


257 


(36.2) 


1 


257 


477 




(3i n i 

V J X . u / 


757 


4 7 7 


572 


137 


(19.3) 


572 


709 






(13 4 ) 


477 


577 


269 

O J 


4 AO 
*t *t u 


(62 1 ) 


269 


709 




769 

to? 


(37 9 ) 


1 

X 


269 


344 


344 


(48.5) 


1 


344 


544 

«s *t *t 


700 


( 2fl 2 ) 


344 


544 
j ** *t 


557 


1 52 

-L 3 £ 


(21 4 ) 


557 


709 






(18) 
V X • o / 


544 
w t *t 


557 


1 ft6 


7 A A 
J *t *t 


( 4fl 5 ) 


365 

J O J 


709 


355 


186 


(26.2) 


1 


186 


360 


1 6Q 


/ 7 3 ft * 


1 ft 6 
X o o 


355 

j j •? 


365 


5 


( 0.7) 


360 


365 




C 


( o 7 ) 


3 55 

0 3 3 


360 


£ Qft 


6 Qft 


( Qfl A ) 


1 
X 


69R 




X X 


(16) 


698 


709 


196 


440 


(62.1) 


269 


709 


269 


196 


(27.6) 


1 


196 




73 


(10.3) 


196 


269 



orrictl 

Finnecan. Henderson -4 9- 

Farabow. Garrett 5 
ft Dunner 

l?7» « STMCCT. M.W. 
WASHINOTON.0.C.2O00* 



fill 

m 

M 





# 


SITES 


FRAGMENTS 


FRAGMENTS 


END 


FOK 1 ^GGATG) 


4 
















192 


i / <t 


I jo . o ; 


435 


709 






204 


192 


(27.1) 


1 


192 






303 

J w J 


132 


(18.6) 




t JJ 






435 


99 


(14.0) 


204 


303 


i 








1 1./; 


192 


204 




1 
















368 


368 


(51.9) 


1 


368 


; 






341 


(48.1) 


368 


709 


: HAE 3 (GGCC) 


3 












i 




30 


616 


(86.9) 


93 


709 


i 






33 


( 4.7) 


in 

J u 


O J 






93 


30 


( 4.2) 


63 


93 


• 






JO 


( 4.2) 


1 


30 


. nVji AX \ IjnuLnL / 


i 


• 














552 


552 


(77.9) 


1 


552 








Id / 


( 22 . 1 J 


552 


709 


uu& i (time) 


1 

X 
















369 


369 


(52.0) 


1 


369 








340 


148.0; 


369 


709 




X 
















118 • 


591 


(83.4) 


118 


709 








118 


(16.6) 


1 


118 


HINF 1 (GANTC) 


2 


















308 


(43.4) 




308 

J \J \J 






587 


279 


(39.4) 


308 


587 








122 


(17.2) 


587 


709 


HPA 2 (CCGG) 


4 
















207 




\ Jl . b ; 


372 


596 






372- 


207 


(29.2) 


1 


207 






J j D 


165 


(23.3) 


207 


372 






654 


58 


( 8.2) 


596 


654 








55 


( 7.8) 


654 


709 


HPH 1 (GGTGA) 


2 
















380 


380 


(53.6) 


1 


380 






519 


190 


(26.8) 


519 


709 








139 


(19.6) 


380 


519 
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FRAGMENTS 


FRAGMENTS 


ENDS 




0 ( G AAG A ) 


1 






(91.7) 




650 






650 


650 


1 


i 
i 
1 








5? 


( 8.3) 


650 


709 


j MNL 

; 


1 (CCTC) 


5 






(27.2) 


81 


274 






81 


193 








274 


1 / *t 


(OA C \ 


535 


709 








406 


132 


(18.6) 


274 


406 


i 
i 
i 






486 


81 


(11.4) 


1 


81 


t 






535 


80 


(11.3) 


406 


486 












( a q ) 

V O . j 1 


486 


535 


i 

. Mo 1 

1 


9 ( rfTNAGG ) 

£ \ 1 I 1 ! AVJVJ / 


1 






(73.9) 


185 


709 




185 


524 


i 








185 


(26.1) 


1 


185 


NCI 


1 (CCSGG) 


2 






(52.5) 




372 




372 


372 


1 








595 


223 


(31.5) 


372 


595 










lid 




595 


709 




i ( rrxTGG) 


1 






(68 ,0) 


227 


709 




227 


482 








001 




1 


227 


Nor' 




1 






(72.2) 


197 


709 




197 


512 










197 


(27.8) 


i 


197 


PST 


1 (CTGCAG) 


3 






(42.0) 




^ rs ft 

298 




298 


298 


1 








327 


261 


(36.8) 


448 


709 








448 


121 


(17.1) 


327 


448 












(41) 


298 


"KOI 


C A TT 




1 






(73.9) 


185 






185 


524 


709 










185 


(26.1) 


1 


185 


SAU 


3 A (GATC) 


1 


150 


559 


(78.8) 


150 


709 








150 


(21.2) 


1 


150 
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^^AGMENTS 
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ENDS 


.!SAU96J. (GGNCC) 

■ j 

!i 








(31.0) 










220 


257 


477 








165 


(23.3) 


92 


257 


!' 




?57 


137 


(19.3) 


572 


709 


!| 




d77 


95 


(13.4) 


477 


572 






3 / I 


63 


( 8.9) 


29 


92 








29 


( 4.1) 


1 


29 


J SCR Fl (CCNGG) 


5 






(48.5) 




344 




j *f *t 


344 


1 






110 
3 1 c 


172 


(24.3) 


372 


544 






3*t *t 


114 


(16.1) 


595 
-j j ^ 


709 






5 57 


38 


( 5.4) 


557 


595 


1 




3 7 3 


28 


( 3.9) 


344 


372 


! 






13 


( 1.8) 


544 


557 


SFA Nl (GATGC) 


1 






(72.8) 


193 


709 






516 








193 


(27.2) 


1 


193 


TTM1X1 1 


1 






(88.9) 


79 


709 


(GACNNNGTC) 




79 


630 






79 


(11.1) 


1 


79 



u 
u 
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The following do not appear: 





2 


AFL 


2 


AFL 3 


AHA 3 


APA 


1 


ASU 


2 


AVA 3 


AVR 2 


BAL 


1 


BAM 


HI 


BCL 1 


BGL 1 


BGL 


2 


BIN 


1 


BSSH 1 


BST E2 


CFR 


1 


CLA 


1 


ECO R5 


FNUD 2 


GDI 


2 


HAE 


1 


HGA 1 


HGI CI 


HGI 


Dl 


HGI 


J2 


HIND 3 


HP A 1 


KPN 


1 


MLU 


1 


MST 1 


NAE 1 


NAR 


1 


NDE 


1 


NRU 1 


NSP CI 


PVU 


1 


PVU 


2 


RRU 1 


RSA 1 


SAC 


1 


SAC 


2 


SAL 1 


SMA 1 


SNA 


1 


SPH 


1 


STU 1 


TAQ 1 


XBA 


1 


XHO 


1 


XHO 2 


XMA 3 


XMN 


JL 











50 60 
GTGTCCCACC CCACCCACAG 



110 120 
TCGTGGGGAC ACCAGAAGTC 

H 
I 
N 
2 

170 180 
CCAAGATGTA TAAAGGGTTC 



230 240 
CCCCCGCCAT GGAGAGTGTC 
N 
C 
0 
1 



10 



20 



30 



40 



ia'i 
i ft 

-Mi 
Vwj 

S& 



GTTGTTGCTG TGGCTGATAG CCCCAGCAGG GCCTGCACCT 

SH 
AA 
UE 
13 

70 80 90 100 

ACGGCCTTCT GCAATTCCGA CCTCGTCATC AGGGCCAAGT 
H T M SH 

A T N AA 

E H L UE 

3 11 13 



130 

AACCAGACCA 



140 

CCTTATACCA 



160 

ATCAAGATGA 



150 

GCGTTATGAG 
S 
A 
U 
A 



190 200 210 220 

CAAGCCTTAG GGGATGCCGC TGACATCCGG TTCGTCTACA 
SD FS FN F H A 

AD OF NS OP C 

UE KA UP K A C 

11 11 12 12 1 
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! 250 


260 


270 


280 


290 


300 


TGCGGATACT 


TCCACAGGTC 


CCACAACCGC 


AGCGAGGAGT 


TTCTCATTGC 


TGGAAAACTG 


t . ■ 


A 


a 

o 


M 




P 


1 


V 


B 


N 




S 


j 
i 


A 


V 


L 




T 


i 

t 

i 


2 


1 


1 




1 


310 


320 


330 


340 


350 


360 


CAGGATGGAC 


TCTTGCACAT 


CACTACCTGC 


AGTTTCGTGG 


CTCCCTGGAA 


CAGCCTGAGC 


F H 




P 




B 


DAD 


0 I 




S 




S 


D L D 


K N 




T 




T 


E U E 


1 1 




1 




1 


111 



: J: 



370 


380 


390 


400 


410 


420 


TTAGCTCAGC 


GCCGGGGCTT 


CACCAAGACC 


TACACTGTTG 


GCTGTGAGGA 


ATGCACAGTG 


A 0 Hit 


ri n 






M 




L D AH 


c ? 






N 




U E EA 


I H 






L 




1 1 '21 


1 1 






1 




430 


440 


450 


460 


470 


480 


TTTCCCTGTT 


TATCCATCCC 


CTGCAAACTG 


CAGAGTGGCA 


CTCATTGCTT 


GTGGACGGAC 




r 


o 
r 






A 




0 


s 






V 




K 


T 






A 




1 


1 






2 


490 


500 


510 


520 


530 


540 


CAGCTCCTCC 


AAGGCTCTGA 


AAAGGGCTTC 


CAGTCCCGTC 


ACCTTGCCTG 


CCTGCCTCGG 


A M 






H 




MA 


L N 






P 




NV 


U L 






H 




LA 


1 1 






1 




11 


550 


560 


570 


580 


590 


600 


GAGCCAGGGC 


TGTGCACCTG 


GCAGTCCCTG 


CGGTCCCAGA 


TAGCCTGAAT 


CCTGCCCGGA 


B 


H B 




A 


H 


NH 


S 


G S 




V 


I 


CP 


T 


I T 




A 


N 


IA 


1 


1 1 




2 


1 


12 


610 


620 


630 


640 


650 


660 


GTGGAAGCTG 


AAGCCTGCAC 


AGTGTCCACC 


CTGTTCCCAC 


TCCCATCTTT 


CTTCCGGACA 


A 








M 


H 


L 








B 


P 


U 








0 


A 


1 








2 


2 
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670 680 690 700 

ATGAAATAAA GAGTTACCAC CCAGCaAaAA AAAAAAGGAA TTC 

• E 
C 
0 

1 



ij EXAMPLE 2-EXPRESSION OF COLLAGENESE INHIBITOR IN E. COLI 

! ! . 

ij In this Example, a preferred method of coupling a preferrec 

'portable DNA sequence to' the 5' end of the cloned cDNA is set 
i> forth. This involves making a nucleolytic cleavage at a speci- 
fied point within the coding sequence and reconstructing the de- 

•sired portion of the coding sequence by means of synthetic 

. j 

''oligonucleotides in a manner that allows its excision and recom- 
bination (i.e., by incorporating useful restriction sites). 

Trimming the 5' end of the coding region will be 
accomplished by synthesizing both strands of the DNA extending 
from the Tthllll site in the 5' direction and ending in a BamHI 
overhang. This synthetic oligonucleotide, referred to as FIBAC 
A, has the following features: 

(1) Codon selection has been biased toward those most fre- 
quently found in the genes of highly expressed bacten 
al proteins; 

(2) A methionine codon from which to initiate translation 
has been provided immediately upstream from the 
cysteine which begins the coding region of human pro- 
cessed FIBAC; 

(3) The spacing of the BamHI site to the methionine codon 
is such that when cloned into p(JC8 , the coding region 
of FIBAC will be in-frame with the 5' end of the beta- 
galactosidase gene; 
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An in-frame stop codon ai^^hine Dalgarno sequence are 
also presented. Translate, of this frame for the 
amino terminal portion of the beta-galactosidase is 
terminated at the TAA 'codon, and translation o. fibac 
should be initiated at the following ATG; 

(5) Codons have b en selected to create a Hgj.AI site begin- 
ning with the G in the FIBAC initiation codon; and 

(6) There is a Pvul site separated by one base from the 3' 
end of the BamHI sequence. 

The structure of FIBAC A Jrs 

GA TCC GCG ATC GGA GTG yfAA GAA ATG TGC ACT 
G CGC TAG CCT CAJ? ATT CTT TAC ACG TGA 



TGC GTT CCG CCG 
ACG CAA GGC GGC 



CCG CAG ACT GCT TTC 
GGC GTC TGA CGA AAG 



TGC AAC TCT GAC C 
FIBA™ ^synthesized using the ABI DNA synthesizer (Foster 
City, California) as a series of four component oligonucleotides. 
Component oligonucleotide FAl is: 



component, uay/mui-icu!.^ » ~- — 

GATCC GCGAT C^GAG TGTAA GAAAT GTGCA CTTGC 

Component oligonucleotide FA2 is: 

GGAACG CAAGT GCA/A TTTCT TACAC TCCGA TCGCG 

ComDonent oligonucleotide FA3 is: 

GTTC CGCCG CATCC/GCAGA CTGCT TTCTG CAACT CTGAC C 

Component ol igonudleotide FA4 is: 

AGGTC AGAGT TGCA/ AAAGC AGTCT GCGGA TGCGG C 



The remainder of the coding portion of the FIBAC gene is 
isolated as the 3' Tthllll to EcoRI fragment generated by a dou- 
ble digest of pUC9-F5/237P10 with these enzymes. 

A synthetic linker is made to couple the 3' end of the 
Tthllll to EcoRI fragment' to a Sail site. These oligonucleotide 
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vil^be designed to recreate the I site and destroy the Ecc RI 
site.w.The linker is comprised of the oligonucleotides linker a: 
and linker A2 . 

Oe<i jp who) 

Linker Al^ is : AATTGGCAG 

CS6Q if i* o; 
Linker A2 is: TCGACTGCC 
A 

These oligonucleotides and oligonucleotides FA1-FA4 are 
kinased separately and annealed in equal molar ratios with the 
Tthllll to EcoRI 3* end of the cDNA and BamHI/Sall cut mpl9RF 
DNA. The ligated DNA is used to transfect JM105. Plaques are 
picked by their color in the presence of IPTG and X-gal and by 
'hybridization to oligonucleotide FA2. Several positive plaques 
are to be sequenced. Those containing the designed sequence are 
subcloned into BamHI/Sall digested pUC8. Translation of the 
FIBAC gene in this construct is coupled to translation initiatec 
for beta-galactosidase. This expression vector is referred to a 
pUC8-Fic. 

Coupling translation of FIBAC to translation initiated for 
other highly expressed proteins is similarly arranged. For exan 
pie, a portion of the OmpA gene which contains the Shine-Dalgarr 
and initiator methionine sequences has been synthesized. This 
sequence encodes the entire signal peptide of OmpA protein and 
! had convenient restriction sites, including those for EcoRI, 
EcoRV, Pvu l , and Stu I . 
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kqtrenee of the sense strd 



' 10 


20 


30 


40 


5C 


6 C 


1 GAATTCGATA 


TCTCGTTGGA 


GATATTCATG 


ACGTATTTTG 


GATGATAACG 


AGGCGCAAA^ 


j E T E 






F 


M 


H 


I L A u 






o 


N 


H 


In 0 0 






K 


L 


A 


; 1 15 






1 


1 


1 


70 


80 


90 


100 


110 




; AATGAAAAAG 


ACAGCTATCG 


CGATCGCAGT 


GGCACTGGCT 


GGTTTCGCTA 


CCGTA 



A 


NF 


PS 


L 


RN 


VA 


U 


UU 


UU 


1 


12 


1A 


130 







120 

GCGCA GGCCTCTGGT AAAAGCTT 
H S H M HA 
H TAN IL 
A U E L NU 
113 1 31 

This sequence is hereinafter referred to as OmpA leader. 

Coupling the translation of FIBAC to OmpA is accomplished by cut- 
ting the pUC8-Fic with Pvu l and Sai l and isolating the coding re- 
gion. This, together with the Eco RI to Pvu l fragment isolated 
from OmpA leader, will be cloned into EcoRI /Sal I -cut pUC8 . As in 
the prior example, transcription is driven by the lac promoter 
and regulated by the lac I gene product at the lac operator. 
This FIBAC expression vector is referred to as pUC8-F/OmpAic . 

To effect the translocation of FIBAC out of the inner cell 
membrane, an appropriate leader sequence is added to the amino 
terminus of FIBAC. The protein thus produced will be translo- 
cated and processed to yield the mature form. 
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'i ft, ,«.et such . translocation. FIBAC gen. encoding the 

!! sig^Tpeptide of the S^U O-p* protein continuous with t,e 

iistructura! region of FIBAC is created. This particular FIBAC^ 

ge n« necessitates having in frame stop codons at the 5' end = . 

th . FIBAC c=din 9 reoion chanced. To accomplish this, the por-.: = , 

oC th, 5' coding reoion fro. pUCB-Fic that extends fro. the Hc^- 

".It. to the Hco! site is isolated. Upstream sequences are 

synthesized as a linker having cohesive ends from B^HI and 

Hg_iAI and contains an internal St*I site. This is synthesized 

as two oligonucleotides linker Bl and linker B2. 
as two * f>w;vC) 

Linker Bl.is: GATCCCAGGCCTGCA 

",<-n IOC". • >aJ 
Linker B20S: GGCCTGG 

Linkers Bl and B2 are kinased separately and annealed in 
eq ual molar ratios with the HgiAI to See, fragment described 
ab0 ve and B^HI/^. cut puCS-Fic. The resulting construct as 
the coding sequence of FIBAC in frame with the translation of th- 
amino terminus of heta-galactos idase . Translation of tM. ..- 
qu .nce forms a fusion protein with FIBAC. This plasmid 

referred to as pUCB-Ff . 

Attaching the OmpA leader sequences to the coding region 
FIBAC is accomplished b y Hgating EcoRI/StuI cut pUCB-Ff with ar 
excess of the purified ^ to M . <»— « of OmpA leader 
Allowing transformation, plasmids from several colon.es < 
ch aracterUed b y hyhridUation. Those that have incorporated t. 
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jOmpA leader fragment are characterized further to verify the 

I 

jstructure. This plasmid, pUC8-F OmpAl, will direct the synthesis 

•!of a fusion protein beginning in the signal peptide of the 

i! 

i. coli OmpA protein and ending in human FIBAC. The signals present 
in the OmpA portion of the protein effect the protein's export 

■ i 

from the cytoplasm and appropriate cleavage from the primary 
structure of FIBAC, 

j If the efficiency of expression were to be compromised by 

the sequence of the leader peptide or its combination with FIBAC 
either at the protein or at the nucleic acid level, the gene 
could be altered to encode any of several known E, coli leader 
^ f sequences. 

^ Transcription of all of the genes discussed is effected by 



the lac promoter. As in the case of initiation sites for trans- 



it lation, the promoter and operator region of the gene may be 

m 

H interchanged. FIBAC may also be expressed from vectors incor- 



porating the lambda Pl promoter and operator (Ol), and the hybric 
promoter operator, Tac as described in Amann, E. , Brosius, J., 
and Ptashne, M. Gene 25:167-178 (1983), specifically incorporatec 
herein by reference. Excision of those portions of the gene 
including ribosome binding site structural region and 3' non- 
translated sequences and insertion in alternate vectors contain- 
ing the Pl or Tac promoter makes use of the unique restriction 
sites that flank these structures in pUC8-F/OmpAic and 
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! pUcJ^OmpAl . Insertion of the EcJ^to Sail fragment from ei- 
ther into similarly digested plasmid pDP8 effects transcription 
of these genes directed by the lambda P L promoter. Transcrip- 
tional regulation would be temperature sensitive by merit of the 
cl857 mutation harbored on this same plasmid. 

Putting similar gene fragments into the transcription unit 
of the Tac promoter will be accomplished by first isolating the 
EcoRV to San fragment. This, together with the synthetic Tac 
promoter sequence which is flanked by BamHI and PyuII sites and 
which contains the lac operator will be inserted into the BamHI 
to Sail sites of pBR322 or preferably derivatives. The deriva- 
tives in this case refer to constructs containing either the lad 

gene or the 1^ gene. 

Expression of FIBAC in host microorganisms other than 

*=tf 

Escherichia is considered. Yeast and bacteria of the genera 

ill 

N Bacillus , Pseudomonas , and Clostridium may each offer particular 

y< advantages. The processes outlined above could easily be adaptec 

to others. 

jjj in general, expression vectors for any microorganism will 

¥ " embody features analogous to those which we have incorporated in 

the above mentioned vectors of E. coli. In some cases, it will 
be possible to simply move the specific gene constructs discussec 
above directly into a vector compatible with the new host. In 
others, it may be necessary or desirable to alter certain 
operational or structural elements of the gene. 
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i| EXAMPLE 3 

;l 

The human collagenase inhibitor may be readily puri.fied 
after expression in a variety of microbes. In each case, the 
spectrum of contaminant proteins will differ. Thus, appropriate 
purification steps will be selected from a variety of steps 
/already known to give a good separation of the human collagenase 
■inhibitor from other proteins and from other procedures which are 

■! likely to work, 

■ i 

If the inhibitor is not secreted from the microbes, it may 
•form inclusion bodies inside the recombinant microbes. These 
bodies are separated from other proteins by differential 
centrifugat ion after disruption of the cells with a French Press. 
The insoluble inclusion bodies are solubilized in 6 M guanidine 
fil hydrochloride or 8 M urea, and the inhibitor protein is more com- 

|ij pletely solubilized by reaction of its cysteines with sodium 

sulfite. At any time subsequent to this step, the cysteines are 
li converted back to their reduced form with di thiothreitol . Once 

the inhibitor protein is solubilized from inclusion bodies, 
immunoaf f ini ty chromatography using antibodies raised against the 
unfolded inhibitor are used .for purification before refolding. 

The inhibitor can be refolded according to the protocol men- 
tioned in Example 6, infra . After refolding of the inhibitor, or 
if the inhibitor is secreted from the microbes, purification from 
other proteins is accomplished by a variety of methods. Initial 
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isteps include ultrafiltration through a 50 K dalton cutoff mem- 
jjbran or ammonium sulfate fractionation. Other useful methods 
.include, but are not limited to, ion-exchange chromatography, gel 

^filtration, hepar in-sepharose chromatography, reversed-phase 

i! 

; chromatography, or zinc-chelate chromatography. All of these 

; i 

•steps have been successfully used in purification protocols. 

, i 

Additional high resolution steps include hydrophobic interaction 
chromatography or imxnunoaf f inity chromatography. After purifica- 
tion, the metalloproteinase inhibitor is preferably at least 
90-95% pure. 

EXAMPLE 4 



Purification of Human Collagenase Inhibitor from Human 
^ Amniotic Fluid 

n^i i ... 

?1 Human amniotic fluid obtained from discarded amniocentesis 

M samples was pooled and 6 liters were subjected to ultrafiltration 

iil 

'"I through a 100 kD MW cutoff filter, obtained from Millipore Corpo- 

M* ration, in a Millipore Pellicon Cassette System. The eluate was 

M concentrated through a 10 kD cutoff filter, obtained from 

Millipore Corporation, then through an Amicon PM-10 membrane. 
Aliquots (10 ml) of concentrated amniotic fluid were eluted 
through a 2.5 x 100 cm column of Ultrogel AcA54, obtained from 
LKB Corporation, which was equilibrated with pH 7.6, 0.05 M 
hepes, 1 M sodium chloride, 0.01 M calcium chloride, and 0.02% 
sodium azide (all chemicals were obtained from Sigma Chemical 
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)^^ny) . Fractions containing t^^i 



;j Company), Fractions containing the inhibitor were collected and 

!j 

; pooled, dialyzed against pH 7.5, 0,025 M Hepes buffer containing 
!! 0.01 M calcium chloride and 0.02% sodium azide, and loaded onto a 
ij 1.5 x 28 cm heparin-sepharose CL-6B (obtained from Pharmacia, 
Inc.) column equilibrated with the same buffer. This column was 
rinsed with 1 liter of the above buffer and eluted with a linear 
gradient of 0-0.3 M sodium chloride. The fractions from the 
largest peak of inhibitor activity, eluting at about 0.1-0.15 M 
sodium chloride, were pooled, concentrated to 1 ml, and loaded 
onto a Synchropak rp-8 reverse phase HPLC column equilibrated 
with 0.05% trif luoroacet ic acid (Aldrich Chemical Company). The 
column was eluted with a linear gradient of 0-40% acetonitrile 
(J. T. Baker Chemical Company) at 1/2% per minute. All fractions 
f!| were immediately dried in a Savant speed-vac concentrator to re- 



iij move acetonitrile, and redissolved in pH 7.5, 0.1 M Hepes before 

yt 

assay. The inhibitor eluted between 32-38% acetonitrile. Frac- 



tions containing the inhibitor were pooled, and 100 ul aliquots 
were eluted over a Bio-rad biosil-TSK 250 HPLC gel filtration 
column. The pooled peaks of inhibitor activity contained 0.1 mg 
of inhibitor, which was over 95% pure as judged by SDS- poly- 
acrylamide gel electrophoresis. 
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EXAMPLE 5 



jj Purification of Human Fibroblast Collagenase Inhibitor from 
Human Embryonic Skin Fibroblast Serum-Free Medium 

Human embryonic skin fibroblasts were grown in serum-free 
tissue culture medium. Ten liters of this medium were collected, 
dialyzed against pH 7.5, 0.02 M hepes buffer containing 0.02% so- 
dium azide and 0.01 M calcium chloride, and applied to a 2.8 x 48 
cm column of hepar in-sepharose CL-6B (Pharmacia, Inc.) 
equilibrated with the same buffer. The column was rinsed with 2 
liters of this buffer and was then eluted with linear gradient of 
0-0.3 M sodium chloride contained in this buffer. The fractions 
obtained were tested for the presence of inhibitor by their abil- 
$ ity to inhibit human fibroblast collagenase. The fractions cor- 

IC| responding to the peak of activity were those obtained near 0.15 

hj M sodium chloride. These fractions were concentrated to about 5 

ml. by ultrafiltration through an Amicon YM10 filter and the con- 
centrate was applied in four separate runs to a 250 x 4.1 mm 
Synchropak rp-8 reverse phase HPLC column, equilibrated with 1% 
trif luoroacet ic acid. The column was eluted with a 0-60% linear 
gradient of acetonitrile in 0.1% tr if luoroacetic acid. The gra- 
dient was run at 1/2% acetontrile per minute. The inhibitor 
eluted in two sharp peaks between 26-29% acetonitrile. All frac- 
tions were immediately dried in a Savant speed-vac concentrator, 
redissolved in pH 7.5, 0.1 M Hepes, and assayed. At least 1.2 mg 
of collagenase inhibitor was recovered, which was 90-95% pure. 
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This material gives a single band when run on a 17.5% reducing 
SDS gel. After carboxymethylat ion of the cysteines and elution 
through the same rp-8 column under identical conditions, the in- 
hibitor is suitably homogenous for protein sequencing. 

EXAMPLE 6 

It is contemplated that the human collagenase inhibitor can 
be readily refolded into its native structure from its denatured 
state after expression of its gene in a microbe and separation of 
the collagenase inhibitor from most of the other proteins pro- 
duced by the microbe. By analogy to the conditions necessary for 
the refolding of other disulf ide-contanning proteins as set forth 
by Freedman, R. B. and Hillson, D. A., in "Formation of Disulfide 
Bonds In: The Enzymoloqy of Post-Translat ional Modification of 
Proteins , Vol. 1, R. B. Freedman and H. C. Hawkins, eds . , pp. 
158-207 (1980), specifically incorporated herein by reference, 



M 

la 
W 

b refolding of the human collagenase inhibitor should occur in so- 

M: lutions with a pH of 8.0 or greater. At this pH, the cysteines 



of the protein are partially ionized, and this condition is nec- 
essary for the attainment of native disulfide bond pairings. The 
inhibitor concentration should be relatively low, less than 0.1 
mg/ml, to minimize the formation of intermolecular disulfide- 
linked aggregates which will interfere with the refolding pro- 
cess . 
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Since the stability of the refolded (native) disulfide bond- 

ii 

i ed structure relative to the unfolded (reduced) structure depends 

ii 

■on both the solution oxidation-reduction potential and the con- 
Icentrations of other redox-active molecules, it is contemplated 
that the redox potential should be buffered with a redox buffer 
^giving a potential equivalent to a reduced: oxidized glutathione 
'ratio of 10. The preferred concentration range of reduced 

glutathione would be 0,1-1. 0 mM. At higher concentrations, mixed 

li 

'disulfides will form with protein, reducing the yield of the 
refolded (native) structure. The relative stabilities of the 
unfolded protein and the native structure, and thus the rate and 
yield of refolding, will also depend on other solution variables, 
such as the pH, temperature, type of hydrogen-ion buffer, ionic 
strength, and the presence or absence of particular anions or 



$ cations as discussed in Privalov, P. L., "Stability of Proteins, 

M Small Globular Proteins," in Advances in Protein Chemistry , Vol. 



33, pp. 167-236, (1979), specifically incorporated herein by ref- 
erence. These conditions vary for every protein and can be de- 
ggj termined experimentally. It is contemplated that addition of any 

molecule that strongly prefers to bind the native (as opposed to 
the unfolded) structure, and which can be readily separated 
afterwards from the native (refolded) protein, will increase not 
only the yield but the rate of re-folding. These molecules in- 
clude monoclonal antibodies raised against the native structure, 
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land other proteins which tightly bind the native collagenase in- 
jhibitor, such as the mammalian enzymes collagenase or gelatinase 

| Example 7 

! 

! JJue-seeofid-prei-e Trod aoquonc e ac sot — forth h e r e r 



m 



Mi 



10 



20 



30 40 50 6 

!GGCCATCGCC GCAGATCCAG CGCCCAGAGA GACACCAGAG AACCCACCAT GGCCCCCTT 

HH N S 



H 
A 
E 
3 



F 
N 
U 
1 



XB 
HI 
ON 
21 



AH 
EA 
21 



C 
0 

1 



A 

U 
1 



70 



80 90 100 110 12 

GACCCCTGGC TTCTGCATCC TGTTGTTGCT GTGGCTGATA GCCCCAGCAG GGCCTGCAC 

S H 
A A 



B 
S 
T 
1 



SF 

FO 
AK 
11 



U E 
1 3 



130 140 150 160 170 18C 

TGTGTCCCAC CCCACCCACA GACGGCCTTC TGCAATTCCG ACCTCGTCAT CAGGGCCAAC 

T M SH 



H 
A 
E 
3 



T 
H 
1 



N 

L 
1 



AA 

UE 
13 



190 200 210 220 230 24C 

TTCGTGGGGA CACCAGAAGT CAACCAGACC ACCTTATACC AGCGTTATGA GATCAAGATC 

H S 

1 A 
N U 

2 A 
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250 


260 


270 




280 


290 


ACCAAGATGT 


ATAAAGGGTT 


CCAAGCCTTA 


GGGGATGCCG 


CTGACATCCG 






SD 


FS 


FN 


F H 






AD 


OF 


NS 


O P 






UE 


KA 


UP 


K A 






11 


11 


12 


1 2 


310 


320 


330 




340 


350 


ACCCCCGCCA 


TGGAGAGTGT 


CTGCGGATAC 


TTCCACAGGT 


CCCACAACCG 


N 








A 


B 


C 








V 


B 


0 








A 


V 


1 








2 


1 






-68 









30C 



A 
C 
C 
1 



M 
N 
L 
1 



360 



1 1 

li 

ji J^u JOU J3U 400 410 42C 

ij 

i! 
II 



370 


380 




390 


*t\JU 


410 


TTTCTCATTG 


CTGGAAAACT 


GCAGGATGGA 


CTCTTGCACA 


TCACTACCTG 




P 


F 


H 




P 




c 


0 


I 




S 




1 


K 


N 




T 




1 


1 


1 




1 


430 












GCTCCCTGGA 


AC 











B 
S 
T 
1 

has the following restriction sites: 

# SITES FRAGMENTS FRAGMENTS ENDS 



ACC 1 (GTVWAC) 



1 



295 295 (68.3) 1 295 

q 137 (31.7) 295 432 



AVA 2 (GGRCC) 



338 338 (78.2) 1 338 

94 (21.8) 338 432 



$ 

- b J: 

W BBV 1 (GCTGC) 

N 1 

» 350 350 (81.0) 1 350 

M- 82 (19.0) 350 432 

M. 

U 



BIN 1 (GGATC) 



u 14 ( 3.2) 1 14 



14 418 (96.8) 14 432 



BST Nl (CCRGG) 



65 360 (83.3) 65 425 

425 65 (15.0) 1 65 

7 ( 1.6) 425 432 
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VI 

in 



DDE l-(CTNAG) 



FNU4H 1 (GCNGC) 



# 

1 



SITES 



FRAGMENTS 



FRAGMENTS SNDS 



; FOK 1 (GGATG) 



HAE 2 (PGCGCQ) 



HAE 3 (GGCC) 



HHA 1 (GCGC) 



HINC 2 (GTQPAC) 



X D / 


OCT 

id/ 




1 


267 




165 


(38.2) 


267 


432 


Q 
0 


269 


(62,3) 


8 


277 


111 


82 


/in f\ \ 

(19.0) 


350 


432 


J DU 


73 


(16.9) 


^ ^ 
277 


n c rt 

350 




o 
0 


( 1.9) 


i 


8 


76 


197 


(45.6) 


76 


273 


tli 


99 


(22.9) 


IOC 

285 


384 


285 


76 


(17.6) 


1 


76 


384" 


48 


(11.1) 


384 


432 




12 


( 2.8) 


273 


285 


19 


413 


( 95. 6 ) 


19 


432 




1 Q 

19 


i a A \ 

( 4.4) 


1 


1 9 


1 


258 


(59.7) 


174 


432 


51 


60 


(13.9) 


51 


111 


111 

111 


50 


(11.6) 


1 


C 1 

51 


144 


33 


( 7.6) 


111 


144 


174 


30 


( 6.9) 


144 


174 




1 


( 0.2) 


1 


1 


20 


412 


(95.4) 


20 


432 




20 


( 4.6) 


1 


20 


199 


233 


(53.9) 


199 


432 




199 


(46.1) 


1 


199 
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jj # SITES FRAGMENTS FRAGMENTS ENDT 



:| HINF 1 (GANTC) 

; i 1 



! HPA 2 (CCGG) 



MNL 1 (CCTC) 



MST 2 (CCTNAGG) 



NCO 1 (CCATGG) 



389 389 (90.0) 1 389 

43 (10.0) 389 432 



288 288 (66.7) 1 288 

144 (33. 3) 288 432 



162 193 (44.7) 162 355 

355 162 (37.5) 1 162 

77 (17.8) 355 432 



266 266 (61.6) 1 266 

166 (38.4) 266 432 



$ "47 261 (60.4) 47 308 

'W 308 124 (28.7) 308 432 



47 (10.9) 1 47 



W NSP B2 (CVGCWG) 

SI 1 

C 278 278 (64.4) 1 278 

}* 154 (35.6) 278 432 

fa PST 1 (CTGCAG) 



SAU 1 (CCTNAGG) 



379 379 (87.7) 1 379 

408 29 ( 6.7) 379 408 

24 ( 5.6) 408 432 



266 266 (61.6) 1 266 

166 (38.4) 266 432 
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SAU 3 A- • ( GATC ) 



# SITES 
2 

14 

231 



SAU 9 6 1 (GGNCC) 



51 
110 
173 
338 



FRAGMENTS 



217 (50.2) 
201 (46.5) 
14 ( 3.2) 



165 
94 
63 
59 
51 



(38.2) 
(21.8) 
(14.6) 
(13.7) 
(11.8) 



FRAGMENTS ENDS 



14 231 
231 432 
1 14 



173 
338 
110 
51 
1 



338 
432 
173 
110 
51 



SCR Fl (CCNGG) 



65 
425 



D 
C4 



•U1 



Ml 



SFA Nl (GATGC) 



75 
274 



STY 1 (CCRRGG) 



47 
308 



TTH111 1 (GACNNNGTC) 



XHO 2 (PGATGQ) 



160 



13 



The following do not appear: 



AAT 2 
AHA 3 
AVA 1 



AFL 2 
ALU 1 
AVA 3 



360 
65 
7 



199 
158 
75 



261 
124 
47 



272 
160 



419 
13 



(83.3) 
(15.0) 
( 1.6) 



(46.1) 
(36.6) 
(17.4) 



(60.4) 
(28.7) 
(10.9) 



(63.0) 
(37.0) 



(97.0) 
( 3.0) 



AFL 3 
APA 1 
AVR 2 



65 
1 

425 



75 
274 
1 



47 
308 
1 



160 
1 



13 
1 



425 
65 
432 



274 
432 

75 



308 
432 
47 



432 
160 



432 
13 



AHA 2 
ASU 2 
BAL 1 
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j! BAM HI 


RAM 1 
DAM X 


RAM 0 




i j BGL 1 


RPT 0 


pew 1 
Don x 


uor x^oo 


|i nSSH 1-- 


DO 1 CjZ 


f PR 1 




i ECO Rl 








, HAE 1 




T AT 


n vj x v. x 


i; HGI DX 




utma 0 


HPA 1 


\ 1 HPn 1 


If DM 1 


rlD\J C 


1*1 l_j u X 


worn 1 

MST 1 


MATT 1 


MAR 1 


MP T 1 


NDE 1 


MrlCj -L 


MOT 1 


MRTT 1 
IN f\U X 


: Nor LI 


own i 
rvu x 


PVTT ? 
rvu ^ 


RRII 1 


OCX 1 

: RSA 1 


CAP 1 
oAL X 


Q AP 9 


^AT 1 

O £\Lj X 


SCA 1 


SMA 1 


SNA 1 


SNA Bl 


SPE 1 


SPH 1 


SSP 1 


STU 1 


: TAQ 1 


XBA 1 


XHO 1 


XMA 3 


; XMN 1 









The salient features of this cDNA are: 



f ; 1; 



ill 



hi 

p 



1. The coding strand is presented in the 5' to 
3' convention with the polyC tract at the 5' 
end. 

2. If the first G in the sequence GGC CAT CGC 
CGC is considered as nucleotide 1, then an 
open reading frame exists from nucleotide 1 
through nucleotide 432, which is the 3' end 
of this partial cDNA. 

3. The first methionine in this reading frame is 
encoded by nucleotides 49 through 51 and 
represents • the initiation site of transla- 
tion. 

4. The amino acid sequence prescribed by 
nucleotides 49 through 114 is not found in 
the primary structure of the mature protein, 
but it is the sequence of the leader peptide 
of human protein. 

5. The sequence of nucleotides 82 through 432 is 
identical to the sequence of nucleotides 
numbered 1 through 351 in the insert from 
the first preferred sequence of Example 1. 

6. The amino acid sequence of the mature protein 
displays two consensus sequences for sugar 
attachment. These sequences, -N-Q-T- pre- 
scribed by nucleotides 202 through 210 and 
-N-R-S- prescribed by nucleotides 346 
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through 354, are amino residues 30 
through 32 and 78 through 80, respectively, 
in the mature protein. Both sites are 
glycosylated in the human inhibitor protein. 

It will be apparent to those skilled in the art that various 

modifications and variations can be made in the processes and 

products of the present invention. Thus, it is intended that th< 

present invention cover the modifications and variations of this 

invention provided they come within the scope of the appended 

claims and their equivalence. 



0 



■it 
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